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Preface 


In September — December 1996 I had the pleasure to visit the Department of 
Mathematics of the University of Hong Kong. The present book contains an 
extended version of the graduate course on selected topics of algebra which 
I gave in the first semester of 1996/97. It is based on similar courses at the 
Department of Mathematics and Informatics of the University of Sofia and 
on short cycles of lectures presented at conferences and seminars. 

The book is devoted to the combinatorial theory of polynomial algebras, 
free associative and free Lie algebras, and algebras with polynomial identities. 
It also examines the structure of automorphism groups of free and relatively 
free algebras. 

The goal of the book is to involve the reader as soon as possible in the 
research area, to make him or her able to read books and papers on the 
considered topics and, hopefully, to commence research in some of the fields 
related to the book. Following this idea, I have included some classical results 
as well as some contemporary results and methods. Some results are given 
as exercises. Each chapter contains comments and references to the current 
situation in the discussed field which will be useful for the orientation of the 
reader. 

I have tried to make the exposition accessible for graduate students with 
standard background on linear algebra and some elements of ring theory and 
group theory. Nevertheless I hope that a professional mathematician working 
in the field of algebra and other related topics also will find the book useful 
for his or her research. 


Hong Kong — Sofia Vesselin Drensky 
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Introduction 


This book is devoted to the theory of polynomial algebras, free associative 
and free Lie algebras, and algebras with polynomial identities. Presenting to 
the reader this area of mathematics, I shall try to answer two main questions: 


I. Why we study these topics? 
and 
II. How we study the objects? 


I Why We Study Free Algebras and PI-Algebras? 


Our starting point is the assumption that the classes of all commutative 
and all finite dimensional algebras are important and have nice algebraic 
properties. It is natural to try to generalize their theory. Therefore, we have 
to find some reasonably big class of algebras which enjoys the most important 
properties of commutative and of finite dimensional algebras. We shall give 
a “naive” motivation on the following example. Let R C C” be any set of 
points in the n-dimensional vector space. 


Problem 1 (i) Find all algebraic equations f(#1,...,@m) = 0 which van- 
ish on R, i.e. all polynomials f(a1,...,%m) € Clai,...,@m] such that 
f(ri,...,%m) = 0 for all r1,...,7%m € R. 
(ii) Find all solutions of the system of equations satisfied by R. Is this 
system of infinitely many equations equivalent to a finite system of equations? 
(iii) Study the properties of the polynomial algebra modulo the ideal 
generated by these equations. 
Problem 2 Describe the polynomials which vanish on “interesting” or “im- 
portant” sets. 


Problem 3 Describe the general properties of the set R C C” defined as 
the set of solutions of some system of algebraic equations. 


I think that there is no doubt that the attempt to solve the above problems 
was one of the main driving forces in the development of analytic geometry, 
linear algebra, commutative algebra and algebraic geometry. 


2 Free Algebras and PI-Algebras 


Now, instead of R C C” we consider an algebra RF over a field K and try 
to translate the above problems in the language of the new situation. 


Problem 1” (i) Find all algebraic equations f(x1,...,%m) =0 which vanish 
on R, ie. f(r1,...,%m) = 0 for all r1,...,7m € R. Answering the question 
what is an algebraic equation, leads us to the notions of a free associative 
algebra (an algebra of polynomials in noncommuting variables), a polynomial 
identity, a Pl-algebra (an algebra with polynomial identity), and a T-ideal 
(an ideal which is invariant under all endomorphisms of the free algebra). 

(ii) Find all algebras satisfying the polynomial identities of R. Is this sys- 
tem of infinitely many identities equivalent to a finite system? This problem 
gives rise to the notion of a variety of algebras and the finite basis or Specht 
problem: Is every T-ideal finitely generated as a T-ideal? 

(iii) Study the properties of the free algebra modulo the ideal generated by 
the polynomial identities of R. This algebra is called a relatively free algebra 
and has many of the universal properties of the polynomial algebra. 


Problem 2. Describe the polynomial identities which vanish on “interest- 
ing” or “important” algebras. 


Problem 3” Describe the general properties of the algebra R, if the only 
information we have is that R satisfies some polynomial identity. 


It turns out that the theory of PI-algebras is also related to other branches 
of mathematics as structure and combinatorial ring theory, the theory of finite 
dimensional division algebras, commutative and noncommutative invariant 
theory, projective geometry, etc. 


II How We Study Free and PI-Algebras? 


Reading a text book or a research article in mathematics we very often wonder 
how it was possible to discover such a complicated theorem. Every profes- 
sional mathematician knows that studying some object one very rarely starts 
with technically complex considerations. Usually the investigation is based on 
some elementary reasoning and calculations in the easiest cases. In the mo- 
ment when one has the proof in the simplest situations and the main ideas 
have been clarified enough, it is only a question of experience and a good 
mathematical background to state the result and its proof in general setup. 


I have tried to show the theory of free algebras and algebras with poly- 
nomial identities from inside. I have included some classical results as well as 
given some contemporary results and methods. Our considerations are purely 
combinatorial and do not contain any structure theory of Pl-algebras. This 
choice is based on my personal taste and on my believe that combinatorial 
ideas are easier to accept for a person not being involved before too deep in 
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algebra. I think that for a graduate course it is not necessary to present the 
theorems in the most general form. Some of the proofs are given in impor- 
tant partial cases which illustrate the main idea. The proof of the general 
case is usually a matter of technique and I believe that the reader will be 
able, if necessary, to reconstruct the complete proofs. Some results are given 
as exercises to the main text. I prefer to reserve the word problem for an open 
problem. Trying to solve these exercises, one should first read the hints. If the 
hint finishes with “see the paper or the book by ...” this means that one faces 
big difficulties and one should consult the paper or the book. Some of the 
exercises contain serious results. I apologize to my colleagues and friends that 
I have treated some of their beautiful, important and complicated theorems 
in this way. I think that partial cases of these results included as exercises do 
show the kitchen of the considered topics. It is also a good idea, even if the 
reader has succeeded in solving an exercise without any assistance, to have 
a look at the original paper and to compare the solution with the original. 
I have added some comments and references for the current situation in the 
discussed fields. Without considering these remarks as a comprehensive sur- 
vey, I hope that they will be useful for the orientation of the reader in the 
topics. 


Main Topics 


The first two chapters are introductory. They give the necessary background 
and fix the notation. Chapter 3 is devoted to the Specht problem and its 
negative solution for Lie algebras in characteristic 2. Chapter 4 deals with 
the reduction of arbitrary polynomial identities to polynomial identities of 
special form: homogeneous, multilinear and the so called proper polynomial 
identities and the relations between them. Chapter 5 contains illustrations 
on concrete examples: the polynomial identities of the Grassmann (or exte- 
rior) algebra and the algebra of upper triangular matrices as well as other 
algebras satisfying the same polynomial identities. Chapter 6 is devoted to 
commutative algebra and its applications to Pl-algebras. I have also included 
as exercises some basic theorems of classical (or commutative) and noncom- 
mutative invariant theory. Chapter 7 discusses the polynomial identities for 
matrix algebras, the Amitsur-Levitzki theorem and central polynomials. The 
next two chapters consider the general properties of Pl-algebras and show 
that, from combinatorial point of view, the Pl-algebras are close to com- 
mutative and finite dimensional algebras. We discuss the theorem of Regev 
for the codimension sequence of the polynomial identities and the Shirshov 
theorem for finitely generated Pl-algebras. The latter leads us to Gelfand- 
Kirillov dimension of finitely generated PI-algebras. Chapter 10 is devoted to 
the automorphisms of polynomial, free and relatively free algebras. Chapter 
11 deals with free Lie algebras, their bases, subalgebras and automorphisms 
and with automorphisms of relatively free Lie algebras. Chapter 12 intro- 
duces the powerful method of representation theory of groups in the study 
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of PlI-algebras. I have also included the final test which I have given to my 
students from the University of Hong Kong and hints to the test. 


Additional Readings 


Here we give a short (and incomplete) list of books which can serve for fur- 
ther reading on some of the topics. Chapters of the books by Cohn [51] and 
Kharchenko [147] contain the theory of free associative algebras and their 
automorphisms and invariants. Concerning similar problems for free Lie al- 
gebras one can read the books by Bourbaki [87], Bahturin [21], Reutenauer 
[228] or Mikhalev and Zolotykh [184]. Good sources for the theory of alge- 
bras with polynomial identities are the books by Procesi [213], Jacobson [128] 
and Rowen [231]. For Lie algebras one should read chapters of the book by 
Bahturin [21]. The book by Hanna Neumann [192] gives the approach to free 
groups and groups with identical relations (which are the analogue of PI- 
algebras). I think that although devoted to group theory, this book is very 
useful also for ring theorists. Specific topics can be found in the literature lis- 
tened in each section. We pay attention to the book by Krause and Lenagan 
[157] on Gelfand-Kirillov dimension and the book by Formanek [108] which is 
a good introduction to the polynomial identities and invariant theory of ma- 
trix algebras. Finally the want to mention the survey articles of Ufnarovski 
[253] and Bahturin and Olshanskii [23] which deal respectively with combina- 
torics of associative algebras and with parallel approach to algebraic systems 
with identical relations (including groups, associative and Lie algebras, etc.). 
As a very good starting point to the topics included in the present book as 
well as for many other topics in ring theory we recommend the two-volume 
book on ring theory by Rowen [232] or its one-volume student version [233]. 


1. Commutative, Associative and Lie 
Algebras 


Throughout the book we fix the notation AK for an arbitrary field of any 
characteristic. All considered vector spaces, algebras, modules, tensor prod- 
ucts are over K. In order to unify the notation we use different fonts in the 
following way: 


Greek: a, 8,...,w —“scalars” (elements of A) and mappings (usually homo- 
morphisms); 
Lower Case Italics: a,b,...,z — elements of vector spaces; 


Upper Case Italics: A, B,...,Z — vector spaces; 

Upper Case German: 2, B,...,3 — classes of sets; 

Upper Case Doubled: C,R,Q,Z,N — as usual, for the sets of complex, real, 
rational, integer and positive integer numbers. 


In the first chapter we introduce the main objects to study: algebras 
(commutative, associative, Lie, etc.) over a field and free algebras, and give 
their basic properties. We prove Poincaré-Birkhoff-Witt Theorem (which de- 
scribes the universal enveloping algebra of a Lie algebra) and some of its 
consequences which we use essentially in our exposition in the next chapters. 
We also introduce the Grassmann (or exterior) algebra which is one of the 
most important algebras in PI-theory. 


1.1 Basic Properties of Algebras 


Definition 1.1.1 A vector space R is called an algebra (or a K-algebra) if R 
is equipped with a binary operation * (i.e. a mapping *« : (R, R) > R), called 
multiplication, such that for any a,b,c € Rand anyae K 


(a+ b)*c=axcthbxe, 





ax(b+c)=axb+axce, 


a(a*b) = (aa) *b =a (ab). 


Usually we denote the multiplication of R by - (and write ab instead of 
a-b), by x, etc. Clearly, the notion of algebra generalizes both the notion of 
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vector space and of ring. Initially we do not require 1 € R, the associativity 


of R, etc. 


Remark 1.1.2 If the algebra R has a basis {e; | ¢ € JF}, then in order 
to define the multiplication in R it is sufficient to know the multiplication 
between the basis elements: 


_ k k o 
ej; ke; = ) arer, U; EK, 
kel 


where for fixed i and j only a finite number of a, are different from 0. 
Oppositely, for any given basis {e; | 7 € I} of the vector space R and a given 
system of elements ak, € K with the property that for fixed 7,7 only a finite 


number of ak, are not 0, we can define the multiplication in R by 


(Xe) * yo ne; = > Einj (ei * ej), 1 Ej = Daten. 


tel gel aged kel 


Definition 1.1.3 The subspace S$ of the algebra R is called a subalgebra if it 
is closed with respect to the multiplication, 1.e. 51,52 € S implies 51 *s_ € S. 
The subalgebra J of R is called a left ideal of Rif RI CI (ie. r*t € J for 
allr € R, i € J). Similarly one defines a right ideal and a two-sided ideal (or 
simply an ideal) (= left + right ideal in the same time, notation J < R). 


Exercise 1.1.4 Show that the following sets are K-algebras: 

(i) Z — any extension of the base field K with the ordinary operations; 

(ii) K[#], K[#1,...,€m] — the polynomials in one or several (commuting) 
variables; K[a1, #2, ...]— the polynomial algebra in countably many variables; 

(iii) M,(A) — the set of all n x n matrices with entries from A, with 
multiplication the usual multiplication of matrices; the set Endx(V) of all 
linear operators of a vector space V with the ordinary operations; 

(iv) Up (AK) — the subset of M, (4) consisting of all upper triangular ma- 
trices, with the usual multiplication; 

(v) sl,(A) — the set of n x n matrices with trace zero and with multipli- 
cation 

[r1, re] = rire — eri, 71,72 € sly (K); 


[71,2] is called the commutator of r; and ro; 
(vi) S, (A) — the set of all symmetric n x n matrices with entries from K 
and with multiplication 


81 0 S29 = 8182 + 8281; 


O,(K) — the set of all skew-symmetric n x n matrices with multiplication 
[ra, ro] : 
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(vii) The vector space KG with basis {g | g € G} where G is a finite 
group. The multiplication in AG is given by 


SS agg (x ast) = So agbngh, ag, Br EK. 


geG hEG g,heG 


Here gh is the product of g and h in G. The algebra AG ' is called the group 
algebra of the group G. 


Hint. (iv) Show that U,,(K) is a subalgebra of M,,(K). 

(v) Use that tr(rire) = tr(reri). 

(vi) If r* denotes the transpose of the matrix r, show that s{ = 51, 85 = se 
implies (81 082)’ = 8,089, and ai = —a,, a4 = —az gives [a1, a2] = —[a1, aa]. 


Exercise 1.1.5 Let char = 0. Which of the algebras in Exercise 1.1.4 have 
nontrivial left ideals and which have nontrivial two-sided ideals? 


Answer. The algebras in (ii), (iii) for mn > 1 and dimV > 1, (iv) for n > 
1, (vii) for |G] > 1 possess nontrivial left ideals but in the case (ili) the 
algebra has trivial two-sided ideals only. In order to handle (vii), prove that 
{0 lgeq 9 | @ € K} is an ideal of KG. 


Definition 1.1.6 The vector space homomorphism ¢ : Ry —> Rz of the 
algebras Ry, Re is an (algebra) homomorphism if 


(a * b) = d(a) * o(b), a,b € Ry. 


Similarly one introduces the notion of isomorphism, automorphism (= iso- 
morphism of R and R), endomorphism (= homomorphism from R to R), 
etc. 


The usual theorems concerning homomorphisms of vector spaces, groups 
and rings hold also for algebras. For example: 


Theorem 1.1.7 Let 6: Ri —> Ry be a homomorphism (of algebras). Then 
the kernel of @ 

Ker(¢) = {r € R, | o(r) = 0} 
is a two-sided ideal of Ry and the factor algebra R,/Ker(¢) is isomorphic to 
the image Im(¢) = {¢(r) |r € Ri} of ¢. 


Definition 1.1.8 Let R be an algebra over K. 
(i) R is associative if (a * b) c= ax (b*c) for every a,b,c € R; 
(ii) Ris commutative if a*b = b*a, a,b © R; 
(iii) Ris a Lie algebra if for every a,b,c € R, 
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axa=0, the anticommutative law, 
(axb)ece+ (bec) *a+(cxa)*b=0, the Jacobi identity; 
(iv) Ris unitary (or unital) if R has a unity e with the property e*r = 


ree=r,reER. 


Very often we shall consider unitary algebras. In this case we make the 
convention that all subalgebras are also unitary with the same unity as the 
algebra. From this point of view, the nontrivial ideals of a unitary algebra 
are not subalgebras. 


Exercise 1.1.9 (i) Show that the anticommutative law implies 
axb=—(bxa),a,be R. 
(ii) If chark 4 2, show that the law 
axa=0,aeER, 
is equivalent to the law 


axb=—(bxa),a,be R. 


Hint. (i) Use that 
0O= (a+ 6) «(a+ b)=axatbxbt (axb+bxa). 


(ii) If a * 6 = —(b* a), then for a = b we obtain 2(a* a) = 0. 


Exercise 1.1.10 (i) Which of the algebras in Exercise 1.1.4 are associative, 
commutative, commutative-associative, unitary, Lie? 

(ii) If R is an associative algebra (with multiplication -), show that R is 
a Lie algebra with respect to the new multiplication [r),r2] = rire — rari, 
r1,r2 € R. We denote this Lie algebra by R&). 

(iii) Show that the three-dimensional real vector space IR? is a Lie algebra 
with respect to the vector multiplication a x b, a,b € R*. 


Exercise 1.1.11 Let & be an algebra and let the direct sum of vector spaces 
R, = K @R be equipped with multiplication 





(ay + r1)(ag +2) = aya + (aire + Aer + 1r2), 1,02 € K, rire € R. 


Show that R, is a unitary algebra. (One says that R, is obtained from R by 
formal adjoint of unity.) 


Convention 1.1.12 Jf not explicitly stated, we always assume that the com- 
mutative associative algebras and the associative algebras are unitary. 
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Exercise 1.1.13 Show that there exist commutative algebras which are not 
associative. 


Definition 1.1.14 Let V and W be vector spaces with bases {v,; | 7 € J} 
and {w; | j € J}, respectively. The tensor product V@W=V x W of V 
and W is the vector space with basis {v; @w,; |i € I,j7 € J}. We assume that 


(= on] &® S > Bj w; = SOS ai Bj (0; ® w;), a3, Bj Ek. 


tel jet iel jes 


For the general definition of the tensor product of modules and its universal 
property see e.g. the book by Lang [161]. If V and W are algebras, then 
V ® W is also an algebra with multiplication 


(v’ Q w')(v" Q w) = (v'v") Q (w'w""), vy E€ V, ww" E€ W. 


Exercise 1.1.15 Show the isomorphism of algebras 


Klei,..-,@m] = Klai,..-,@m—1] @ K [am]. 


Exercise 1.1.16 Show that the tensor product of associative algebras is also 
an associative algebra. 


1.2 Free Algebras 


Definition 1.2.1 Let % be a class of algebras and let F' € U be an algebra 
generated by a set X. The algebra F is called a free algebra in the class 
U, freely generated by the set X, if for any algebra R € UW, every mapping 
X — Rcan be extended to a homomorphism F —> R. The cardinality |X| 
of the set X is called the rank of F. 


Example 1.2.2 For any set X the polynomial algebra KX] is free in the 
class of all unitary commutative associative algebras. 


The proof of the following assertion is an easy exercise. 


Proposition 1.2.3 For every set X the algebra K(X) with basis the set of 
all words 
ji, -..%%,, 01, EX, n=0,1,2,..., 


1 


and multiplication defined by 


(xi, a 5, (€5, a 25) = Ui, Bi, Fj, - + Bj, Ci EH, E X, 
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ts free in the class of all unitary associative algebras. If we consider the sub- 
space of K(X) spanned by the words of length > 1, we obtain the free nonuni- 
tary associative algebra, which is free in the class of all associative algebras. 


Exercise 1.2.4 Let K{X} be the vector space with basis the set of all nonas- 
sociative words, 1.e. words of the form 


(aj, ..)(...a3,), ta, EX, 


where the parentheses are distributed in an arbitrary way. The multiplication 
in K{X} is given by u-v = (u)(v) for any two words u,v. (More precisely, 
we omit the extra parentheses and, for example, write x; -«; = x;“; instead 
of (x;)(a;), a; -u = a;(u) instead of (x;)(u), etc.) Show that this algebra is 
free in the class of all unitary algebras. It is called the absolutely free algebra. 
Since K(X) and K{X} are generalizations of the polynomial algebra ALX], 
we also call their elements polynomials (e.g. in noncommuting variables in 


the case of A(X)). 


Exercise 1.2.5 Prove that the rank of K[X], K(X) and K{X} is an in- 
variant of the algebra, i.e. each of the isomorphisms K[X] = A[Y], K(X) = 
K(Y), K{X} = K{Y} is equivalent to |X| = |Y]. 


Hint. Let w(K(X)) be the ideal of K(X) generated by X, w°(K(X)) = 
K(X). Then 


dimu* (K(X))/w*** K(X)) = |X|*¥, k= 0,1,2,... 


Let @: K(X) —> K(Y) be an isomorphism such that ¢(a#;) = fi(yi,---; ni); 
x; € X. Then the composition 7 = ¢0 6, where 


0: K(X) — K(X), O(a) = 2 — fil0,...,0), a ©, 
is also an isomorphism and 
U(w(K(X))) = w(K), 
dimw(K(X)) /u?(K(X)) = dime(K(Y))/w?(K(Y)), X= [VL 
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1.3 The Poincaré-Birkhoff-Witt Theorem 


Definition 1.3.1 If R is an associative algebra and the Lie algebra G is 
isomorphic to a subalgebra of R‘~), we say that R is an enveloping algebra 
of G. The associative algebra U = U(G) is the universal enveloping algebra 
of the Lie algebra G, if G is a subalgebra of U‘~) and U has the following 
universal property: For any associative algebra R and any homomorphism 
of Lie algebras ¢ : G —> R‘~) there exists a unique homomorphism of 
associative algebras ¢) : U —+ R which extends ¢, 1. w is equal to ¢ on G. 


Poincaré-Birkhoff-Witt Theorem 1.3.2 Every Lie algebra G possesses a 
unique (up to an isomorphism) universal enveloping algebra U(G). If G has 
a basis {e; |4 € I}, and the set of indices I is ordered, then U(G) has a basis 


Ci e+ Cigy 1 Se Sty, te EF, p= 0,1,2,.... 


Proof. Let the multiplication of G be given by 
ee xX era So abien, age. 
kel 


We consider the free associative algebra A(X), where X = {x; | i € I} and 
its ideal J generated by all elements 


[wij] -— So ofan, i, 7 €L 
kel 
Let U = K(X)/J and let y = a; + J,¢€ I. 


Proof of the Universal Property. Let 1 : G —+ U‘) be the vector space 
homomorphism defined by 


LS) Bier + > Bini, O EK. 
ier ier 


Clearly, ¢ is also a Lie algebra homomorphism because for any e; x e; (and 
by linearity for the product of any two elements of G), 


u(e; X €;) = (x ober] = So afje(ex) = 


kel kel 
= Sotiye = [yeu] = (ler), e(es)]. 
kel 


Let R be an associative algebra and let ¢ : G —> R() be any Lie algebra 
homomorphism. Let ¢(e;) = r; € R, it € I. We define a homomorphism 
0: K(X) — R by 0(#;) = 1, ¢ € T. Since 
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[ri, 75] = [d(e:), O(e;)] = Ole: x e;) = Si afo( (en) = ire, 


kel kel 


we obtain that 
[x;, xj] -— S- at eR € Keré, J C Keré, 
kel 
and we can define # : A(X)/J —> R such that ¢ = #04. Prove the unique- 
ness for exercise! 


Proof of the Embedding. We shall show that « is an embedding of G into U. 
Let yj, ...yj, be any word (i.e. a monomial with coefficient 1) in U. If j > 4, 
using the relations 
YPYi = YiYg + So akiun, 
kel 

we can express yj, ..-Yj, a8 a linear combination of words y;,...y;, with 
a <...< tp and p < q. It is sufficient to show that these elements y%, ... yi, 
are linearly independent in U. 

In the vector space K(Z), 7 = {z; | ¢ © I}, we define linear operators 
called reductions in the following way. For a fixed word 


U = 42; 2b = By... 2 252, Zh J > 4; 


the reduction replaces u by 


a( 2.2; + S- ap )b 


kel 


and on all other words it acts identically. Clearly, for any f € A(Z), there 
exists a finite sequence of reductions which brings f to a linear combination 
of words zp, ..-2,, #1 <... < ky, and we call this linear combination the 


reduced form of f. The crucial moment of the proof is the following lemma. 
Lemma 1.3.3 The reduced form of f © K(Z) is unique. 


Proof of Lemma 1.3.8. We order the monomials of K(Z) comparing them 
first by degree and then lexicographically: 


Zi, ++ Zim > P2j,-.-2j,, IF a BER, 


ifeither m > norm =nand t= j1,...,%s = Js, ts¢1 > Jsqi for somes < m. 
This is the so called deg-lex ordering of the monomials in K(Z). Applying a 
reduction p to a monomial w, either p(u) = u or the leading term of p(u) is 
smaller than u. We apply induction on the deg-lex ordering, assuming that 
if the leading term of g € A(Z) is smaller than the leading term of f, then 
g has a unique reduced form. The base of the induction is for f being linear 
when we cannot apply any nontrivial reductions to f and it coincides with 
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its reduced form. The idea of the proof is to show that for any two reductions 
p and o, there exist reductions p,,...,p, and o1,...,0; such that 


(p10... 9 pe)(p(f)) = (719-2 o)(o(F)) 


and then to apply the inductive arguments. It is easy to see that the only 
difficulties are in the case 


p(232221) = (= + S- oe] 1, 


kel 


I 
o(232221) = 23 (as + ) oa) . 


lel 

We define the reductions as follows: p, acts on z2z3z, and changes the places 
of 21, 23: 

Pl * 222321 > 292123 7... 

similarly 

P2 : 222123 + 212923 7... 


01 : 232129 4 21232 +... 





09: 212329 > 212923 +.... 
Direct calculations give 


(p2 © pi © p) (232221) = (p20 pi)(222321 + S_ afoznz1) = 
k& 


m k& 
= po(z221%3) + y 031 222m + y 392k 21 = 
m k& 


_ t m k 
= 2122923 + ) 512123 + ) 31 29%m + ) 0392k21, 
l m k 





(02001 00)(232921) = (02 0 01) (232122 + S © a5, 2321) = 
I 
= 02(212322) + SS a3] 2m 22 + So ab) 2321 = 
m l 


k I 
= 212923 4+ ) A39212k + ) a3) 2m 22 + ) A514 2321. 
l m l 


Now we apply further reductions, if necessary, in order to present all z;2;, 
j >t, as az; + linear terms. In this way r = (po © p1 © p) (232221) and s = 
(02001 00)(232221) are reduced to linear combinations of 21 2223, 2:2;, 21. The 
coefficient of 212223 in the reduced form of r and s is equal to 1, the coefficients 
of z;z; in the reduced form of r and s are also the same. Finally, writing 
down the Jacobi identity for e1,¢2,e3 together with the anticommutativity 
(af, = —ak.), we get some relations between the products afial which give 
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that the coefficients of z; are also equal (check it!). In this way the reduced 
forms of r and s are the same and this completes the proof of the lemma. 


Now we complete the proof of the embedding. We consider an algebra W 
with basis 
(Zi, Zip | 11S... < tp, p > OF, 
and multiplication 
(Zi, ---%i,)(2j, -.-2j,) = the reduced form of %, ...%,2j, .--2j,- 
By Lemma 1.3.3, the multiplication is associative, i.e. W is an associative 
algebra. The kernel of the canonical homomorphism 7 : K(X) > W (defined 
by a(x;) = z;) contains all [x;,2;] -— }°, af the, ie. J C Kerz, and W isa 
homomorphic image of U. Since the images 2, ...2;, Of Yi, ---Yi,>o tS -- < 
tp, are linearly independent in W, the elements y%, ...y;, are also linearly 
independent and we obtain that Kera = J, i.e. U = W and this completes 
the proof of the theorem. 


Exercise 1.3.4 (i) If G and # are Lie algebras and H is a homomorphic 
image of G, then U(H) is a homomorphic image of U(G). 

(ii) U(G, @ G2) = U(G1) @ U(G2). 

(iii) The algebra U(G) has no zero divisors. 


Hint. (iii) See e.g. the book by Bourbaki [37]. See the same book for further 
properties of the universal enveloping algebras of Lie algebras. 


Theorem 1.3.5 (Witt) The Lie subalgebra L(X) of K(X)(—) generated by 
X is isomorphic to the free Lie algebra with X as a set of free generators; 


U(L(X)) = K(X). 


Proof. Let R be any associative algebra and let ¢ : L(X) —> RO) be a 
homomorphism. The mapping ¢o : X > R defined by ¢o(x) = @(), « EX, 
induces a unique homomorphism w : A(X) —> R. Since d(x) = (x), we 
obtain that the restriction of % on L(X) is equal to ¢. Hence U(L(X)) = 
K(X). If Gis a Lie algebra and R is its enveloping algebra, then any mapping 
X > GC R induces a homomorphism K(X) —> R; its restriction on L(X) 
is a homomorphism of L(X) to R(-) which sends the generators of L(X) to 
G. Therefore the image of L(X) is in G and this gives that D(X) is free in 
the class of all Lie algebras. 


Definition 1.3.6 Let Mf be a vector space, let R be an associative algebra 
and let 
p:R— Endx(M) 


be an algebra homomorphism (such that p(1) = id). Then p is called a repre- 
sentation of R in M and M is a left R-module. Similarly one defines a right 
R-module assuming that the linear operators of M act from the right. 
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Exercise 1.3.7 Show that every left ideal of R is a left R-module, where the 
left action of r € R is given by p(r) : 5s rs, 8 € R. 


Proposition 1.3.8 Let H be a subalgebra of the Lie algebra G, and let H be 
generated by hy,...,hm. Let the right U(G)-module GU(G) be generated by 
hy,..., hm. Then G= H. 


Proof. Let H # G. We consider an arbitrary basis {h; | i © J} of H and 
extend it by a set {g; | 7 € J} C G\ 4 to a basis of G, assuming that 
hy < gj. Since the right U(G)-module GU (G) is generated by H, ifg € G\ H 
is a basis element of G, then g = g-1 € GU(G) and 


g= SO hk fis te €U(G). 


k=1 


We express each f;, as in Poincaré-Birkhoff-Witt Theorem 1.3.2: 
I= do hk Y= Pepahp, Ap. Gq. +++ Iaes Peog EK, 
k=1 Psd 


and bring g to its reduced form. Clearly, each product hghp, ...hp, is a 
linear combination of products of f,;’s, i.e. in the reduced form of g each 
summand contains some h;. This contradicts with the linear independence of 


the reduced basis of U(G'). Hence G = H. 


Remark 1.3.9 Using the same idea as in the proof of Poincaré-Birkhoff- Witt 
Theorem 1.3.2 one can develop Grobner bases techniques both for commu- 
tative and noncommutative algebras. The Grobner bases are a very powerful 
tool in computational algebra, algebraic geometry and invariant theory, see 
e.g. the books by Adams and Loustaunau [4] and Sturmfels [247]. For ap- 
plications to noncommutative algebra we forward to the paper by Bergman 
[33], the surveys by Ufnarovski [253], Mora [186] and Belov, Borisenko and 
Latyshev [27] and the lecture notes by Latyshev [167]. 


Definition 1.3.10 Let V be a vector space with ordered basis {e; | i € I}. 
(i) The Grassmann (or exterior) algebra E(V) of V is the associative 
algebra generated by {e; | i € I} and with defining relations 


eye; teje; =0, 2,9 ET, 


(and e? = 0 ifcharKk = 2). This means that E(V) is isomorphic to the factor 
algebra K(X)/J, where X = {x; | ¢ © I} and the ideal J is generated by 
“jx; + 2;x;, t,j € 1. If dimV is countable, we assume that V has a basis 
{e1, €2,...} and denote E(V) by E. 

(ii) If V is equipped with a symmetric bilinear form (, ), the Clifford 
algebra of V is generated by the basis of V with defining relations 
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e;e; + ej, = (ei, €5), igel. 


Exercise 1.3.11 In the notation of Definition 1.3.10, show that the Grass- 
mann and Clifford algebras of the vector space V have (the same) basis 


Ci, ---Ci,, 21 <M tn, te EL,n=0,1,2,.... 


Hint. Apply reductions as in the proof of Poincaré-Birkhoff-Witt Theorem 
1.3.2. 


2. Algebras with Polynomial Identities 


A significant part of the book is devoted to algebras with polynomial iden- 
tities. In this chapter we introduce Pl-algebras and the related with them 
notions of varieties of algebras and relatively free algebras. We also give 
some examples of Pl-algebras which will motivate our further study. Finally, 
we prove the theorem of Birkhoff which describes varieties of any algebraic 
systems in categorical language. 


2.1 Definitions and Examples of PI-Algebras 


From now on we fix a countable infinite set X = {x1, x2,...}. Sometimes we 
shall also use other symbols, (e.g. y, z, yi, 2;, etc.) for the elements of X. 


Definition 2.1.1 (i) Let f = f(a1,...,@n) € K(X) and let R be an asso- 
ciative algebra. We say that f = 0 is a polynomial identity for R if 


f(ri,.-.,%n) =0 forall r,...,% ER. 


Sometimes we shall also say that f itself is a polynomial identity for R. 

(ii) If the associative algebra R satisfies a nontrivial polynomial identity 
f =0 (ie. f is a nonzero element of K(X)), we call Ra Pl-algebra (“PP = 
“Polynomial Identity” ). 


Exercise 2.1.2 Show that f € K(X) is a polynomial identity for R if and 
only if f is in the kernel of all homomorphisms K(X) —> R. 


Examples 2.1.3 (i) The algebra R is commutative if and only if it satisfies 
the polynomial identity 


[ay, r9| = U1 e2 — Cok, = 0. 


(ii) Let R be a finite dimensional associative algebra and let dimR < n. 
Then R satisfies the standard identity of degree n 


Sn (@1,.-.,2n) = S- (signa)o(1)---®o(n) = 9, 
TES y 
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where S;, is the symmetric group of degree n. The algebra F also satisfies the 
Capelli identity 


dy (x1, se ey ny Y1y-- sy Yn+1) = S- (signa) yi ®o(1)Y2 on ‘Yn®a(n\Yn+1 =0. 
TES y 


Hint. (ii) Since both the standard and the Capelli polynomials are skew- 
symmetric in £1,...,%,, it is sufficient to verify that s, = 0 and d, = 0 on 
the elements of a fixed basis of R. Since dimR < n, we obtain that s, and 
d, vanish on R. 


Exercise 2.1.4 Show that the Grassmann algebra F satisfies the polynomial 
identity 
[v1,%2, 3] = [[v1, @2], v3] = 0. 


Hint. Since [a1, #2, x3] is linear in each of the variables x1, x2, v3, it is suffi- 
cient to see that [r1,72,7r3] = 0 for the basis elements of E. Check that 


mn 


[ri, 72] = [ei,..-i,,€j, ---€7,] = (L- (-1) Ci, + Ci ej, ++ -Cjns 


i.e. [71,72] # 0 implies that both m and n are odd integers. Then [r1, 72] is 
of even length. 


Definition 2.1.5 The (Lie) commutator of length n, n > 1, is defined induc- 
tively by 
[x1, x2] = a (adxz) = &1 82 — £94), 


[@i,---5@n—-1,€n] = [[e@1,---,@n-1],2n], n > 2. 


Instead of the above left normed commutators, some books (e.g. Bourbaki 
[37] and Bahturin [21]) use the right normed notation 


[w1, v2] = (ada ,)xe, [a1, 2,3] = [a1, [xo, xsl]. 


Exercise 2.1.6 Show that the tensor product E @ E satisfies the centre-by- 
metabelian identity 
[[a1, xo], [@3, aa], x5] = 0. 


Exercise 2.1.7 Let Mz(K) be the 2 x 2 matrix algebra. Show that Mo(K) 
satisfies the following polynomial identities: 

(i) The standard identity s4(#1, 72, %3,%4) = 0. 

(ii) The Hall identity [[x1, 22], x3] = 0. 

(iii) Show that Ms(.A) does not satisfy the Capelli identity d, = 0 and 
the standard identity s3 = 0. 
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Hint. (i) Fix the basis of M2(K) 
{é = €11 + €22, €11, €12, €21} 
and rewrite the standard polynomial s4 in the form 
84 = [#1, @2] © [w3, &4] + [@1, x3] © [w4, ¥2] + [@1, x4] © [@2, 3]. 


Then s4(e, €11, €12, €21) = 0 because e participates in commutators only. 
(ii) For any 2 x 2 matrix r the Cayley-Hamilton theorem gives that 


a tr(r)r + det(rje = 0. 


If r = [ri, ra], then tr(r) = 0 and r? is a scalar matrix. 
(iii) Find elements of Mz2(A) such that the Capelli polynomial d, does 
not vanish on them. Do the same for the standard polynomial s3. 


Exercise 2.1.8 Show that the n x n matrix algebra M,,(K) satisfies the 
identity of algebraicity 


dn4i(l,a,@*,..., 2" 1,y1,---,9n, 1) = 
= S- (signa)x? Oya? yy... yx?” =0, 
TES n4i 


where the symmetric group 5,41 acts on {0,1,...,2}, and the identity 


$n ([x, y], [x,y]... -5[2”, y]) = 0. 


Hint. Use the Cayley-Hamilton theorem to conclude that 1,a,2?,..., 2” are 
linearly dependent. Do the same conclusion for [x, y], [x7, y],..-,[2”, y]- 


Exercise 2.1.9 Let U, (A) be the algebra of n x n upper triangular matrices. 
Show that U,(K’) satisfies the identity 


[ay, r9| oe .[Wen—1, Lon| = 0. 


Derive from here that U;,(A’) satisfies also the standard identity of degree 
2n. (Later we shall see in Theorem 7.1.3, that M,(K) satisfies so, = 0 and 
this is the statement of the famous Amitsur-Levitzki theorem.) 


Hint. Show that [ri,r2] is an upper triangular matrix with zero diagonal, 
71,72 € U,(K), and use that the product of n such zero diagonal matrices is 
0. Then rewrite so, as 
1 . 
mn (signe) (#51), €o(2)]---[®o(2n—1); ®o(2n)]- 
TESan 
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Exercise 2.1.10 (i) Show that if R is a finite dimensional algebra over a 

finite field K, then R satisfies a nonzero polynomial identity in one variable. 
(ii) If the base field A is finite and |K| = gq, show that there exists a 

positive integer m such that M,(K) satisfies the polynomial identity 








Hint. (i) The algebra R has a finite number of elements (why?). For a fixed 


r € R, at least two of the elements r, r?, r°,... coincide. Hence there exist two 


different positive integers k and / such that the polynomial f,(#) = #* — a! 


satisfies f,(r) = 0 in R. Since R is a finite set, the polynomial 


ge) = [J fe) © K(X) 


reR 


is well defined and is a polynomial identity for R. 

(ii) Since M,,(4) is a finite set, there exists a finite extension Fym of 
the base field AK = IF, which contains the eigenvalues of all matrices r in 
M,,(). Consider r written in its Jordan normal form (over the field Fm). 
Then r?” —r is an upper triangular matrix and its n-th power is equal to 0. 


Remark 2.1.11 Some important properties of associative algebras are ex- 
pressed in the language of polynomial identities. We have seen this for the 
commutativity. Other examples come from nonunitary algebras. The algebra 
Ris nil of bounded index if there exists an n € N such that 2” = 0 is an 
identity for R; the algebra R is nilpotent of class <n if x1...%, = 0 for R. 


Remark 2.1.12 It turns out that the class of all Pl-algebras has good struc- 
ture and combinatorial properties similar to those of the commutative and 
the finite dimensional algebras (see e.g. the books by Rowen [231] and Kemer 
[142]). We hope to illustrate this claim in the present book. 


Remark 2.1.13 Starting with the free Lie algebra D(X) we can define the 
notion of polynomial identity for a Lie algebra G. The basic properties of 
Lie algebras with polynomial identities are similar to those of associative PI- 
algebras (see e.g. the book by Bahturin [21]). In the sequel we shall give the 
basic definitions for associative algebras. Nevertheless, comparing the theory 
of Lie algebras with polynomial identities with that of associative Pl-algebras, 
there is a big difference. The deep structure and combinatorial properties of 
the associative algebras do not hold for Lie algebras. On the other hand, 
many problems are stated in the same way in both the associative and Lie 
cases and very often the same methods for investigation work. 


Exercise 2.1.14 (i) Let G be the two-dimensional Lie algebra with basis as 
a vector space {a,b} and multiplication [a,b] = a. Show that G satisfies the 
polynomial identity 
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[[e1, 2], (v3, v4]] = 0 (the metabelian identity). 


(ii) If G' is a finite dimensional Lie algebra and dimG' < n, then G satisfies 
the Lie standard identity of degree n+ 1 (but in n skew-symmetric variables) 


LoSp(adx,,...,ad%,) = S- (signa) (#0, @o(1),---;€o(n)] = 0. 
TES y 


(iii) Show that the Lie algebra (U;,(K))‘~? of all upper triangular n x n 
matrices satisfies the identity 


[[v1,@2],-.-,[@2n-1, 2n]] = 0. 


Exercise 2.1.15 Let W, be the set of all derivations of the polynomial 
algebra in n variables. (The linear operator 6 of the vector space A [a1,...,@n] 
is a derivation if d(uv) = d(u)u + ud(v), u,v © K[a1,..., &n].) 

(i) Show that W,, is a Lie algebra with respect to the operation [d,,d2] = 
6162 — b261. 

(ii) Show that W satisfies the Lie standard identity 


tos4(ada,, aday, adag, adx4) = 0. 


(iii) Show that W,, satisfies some Lie standard identity. 


Hint. (ii)-(iii) Every mapping {a1,...,%,} —> A[a1,...,%,] is uniquely 
extended to a derivation and this gives that the elements of W, are of the 
form 


So fle vend gg fi € K[ai,..., nl], 


where 0/0x; are the usual partial derivatives. For n = 1 check that 


; 0 0 0 0 0 
X isne) fale) fooled foaled gos fara(a) es Sainte] =0 
for fo,...,f4 © K[e]. For further hints see the book by Bahturin [21]. 


Exercise 2.1.16 Let the base field K be finite and let the Lie algebra G be 
finite dimensional. Prove that G' satisfies a polynomial identity of the form 


«f(ady) = 0. 


Remark 2.1.17 As in the case of associative algebras, some classical prop- 
erties and results for Lie algebras can be stated in the language of polynomial 
identities. Let G be a Lie algebra. 

(i) The algebra G is abelian if it satisfies the identity [%,, 22] = 0 (i.e. has 
a trivial multiplication). 
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(ii) The algebra G is nilpotent of class < n if it satisfies [e1,...,¢n4ai] = 
0. (Pay attention to the difference in the class of nilpotency of associative 
and Lie algebras and compare it with the class of nilpotency for groups. If 
one replaces the Lie commutators [u, v] = uv — vu with group commutators 
(u,v) = u7'v-tuv, the definitions are the same for groups and Lie algebras.) 

(iii) The algebra G is solvable of class < n if it satisfies the identity f, = 0, 
where fn = fn(@1,...,€an), is defined inductively by 


fi(v1, 2) = [x1, v9], 


fn = [fn—1("1, sey Lon-1), fn—1(@on-141, sey ton)|, n>. 
The solvable of class 2 Lie algebras are called metabelian. 

(iv) A version of the Lie theorem (see e.g. [87]) gives that over a field 
of characteristic 0 any solvable finite dimensional Lie algebra satisfies the 
identity 

[[aa, ro, sey [fon—1, Lon]] =0 


for some positive integer n. 


2.2 Varieties and Relatively Free Algebras 


Definition 2.2.1 Let {f;(1,...,an,) © K(X) | ¢ © I} be a set of poly- 
nomials in the free associative algebra K(X). The class U of all associative 
algebras satisfying the polynomial identities f; = 0,7 € J, is called the variety 
(of associative algebras) defined (or determined) by the system of polynomial 
identities {f; | i © I}. The variety W is called a subvariety of U if Wc W. 
The set T(%) of all polynomial identities satisfied by the variety VU is called 
the T-ideal or the verbal ideal of UW. We say, that the T-ideal T(M) is gener- 
ated as a T-ideal by the defining set of identities {f; | i © I} of the variety 
W. We use the notation T(W) = (f; | i € I)? and say that the set {f; | i € I} 
is a basis of the polynomial identities for G. The elements of T(%) are called 
consequences of the polynomial identities in the basis. If R is any algebra, we 
denote by T(R) the T-ideal of the polynomial identities of R. 


Exercise 2.2.2 Show that for any variety U its T-ideal T'(2) is a fully invari- 
ant ideal of K(X), i.e. T(M%) is invariant under all endomorphisms of K(X). 
(In the notion “T-ideal”, “T” is for “Transformation” (= “Endomorphism” ) 
and “Verbal” comes from group theory since the endomorphisms of the free 
group are defined by mapping the free generators to elements (words) of the 
free group.) Show that the same statement for fully invariance holds for the 
ideal T(3) N K(e1,...,2m) of K(a1,...,@m), where x1,...,@m © X. 


Example 2.2.3 The class of all commutative algebras is a variety defined by 
the identity [#1, 22] = 0. The class of all associative algebras is also a variety 
defined by the empty set of polynomial identities. 
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Definition 2.2.4 For a fixed set Y, the algebra Fy (M3) in the variety U is 
called a relatively free algebra of UW (or a U-free algebra), if Fy (%) is free in 
the class U (and is freely generated by Y). 


Now we shall see that the relatively free algebras exist and two algebras 
of the same rank are isomorphic. In the sequel we shall denote the relatively 
free algebra of rank m = |Y| by F,(%). If Y is a countable infinite set we 
use the notation F'(2%) instead of PF, (%). 


Proposition 2.2.5 Let G be a variety defined by {f; | i © I}, let Y be any 
set and let J be the ideal of K(Y) generated by 


{filgi,---,9ni) lay CE KY), 6 ETH. 


Then the algebra F = K(Y)/J is a relatively free algebra in U with set of 
free generators Y = {y+ J |y€Y}. Every two relatively free algebras of the 
same rank in GU are isomorphic. 


Proof. (i) First we shall see that F € U. Let f;(a1,...,@n) be one of the 
defining identities of U and let 91,...,g, be arbitrary elements of F', 9; = 
a3 t+J, 9; © K(Y). Then fi(gi,-.-,9n) € J, hence fi(G1,..-, Gn) = 0 and this 
means that f;(%1,...,@n) = 0 is a polynomial identity for F. Hence F € %. 

(ii) Now we shall prove the universal property of F. Let R be any algebra 
in GW and let 6: Y —> R be an arbitrary mapping. We define a mapping @ : 
Y — R by @(y) = ¢(y) and extend @ to a homomorphism (denoted also by @) 
6: K(Y) —> R. This is always possible because K(Y) is the free associative 
algebra. In order to prove that ¢ can be extended to a homomorphism F’ —> 
R, it is sufficient to show that J C Keré. Let f € J, Le. 


f= So wi fi(gar, 6s Giny)Viy ij, Ui, vi C K(Y). 
tel 

For arbitrary r1,...,7n, € R, the element f;(r1,...,7n,) is equal to zero in R 
and this implies that @(f) = 0, i.e. J C Keré and F = F5-(%) is the relatively 
free algebra in %, freely generated by Y. 

(iii) Let |Y| = |Z], Y = {y | ¢ © I}, 7 = {a | ¢ © TH, and let Fy (%) 
and Fz(%3) be the corresponding relatively free algebras. Since both algebras 
Fy (%) and F'z(%) are relatively free, we can define homomorphisms 


@ : Fy (2) —_ Fz(%), w : Fz(%) —_ Fy (2) 


by ¢(y:) = 2, d(z;) = y. Since the compositions dow and yo¢ act identically 
on Y and Z, respectively, we obtain that ¢ and y are isomorphisms. 


Remark 2.2.6 It follows from the proof of Proposition 2.2.5 that the T-ideal 
of K(X) generated by {f; | ¢ € I} consists of all linear combinations of 
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Ui Li (Git, - +5 Jing) Vir Gig, Ui, Vi, © K(X). 


Theorem 2.2.7 There exists a 1-1 correspondence 7 between the T-ideals 
of K(X) and the varieties of associative algebras; 7 is a “Galots correspon- 
dence”, t.e. for any two T-ideals V,, V2 the inclusion Vi C Vo is equivalent to 
m(Vi) >) m(V). 


Proof. For every T-ideal V we define % = a(V) to be the variety determined 
by the polynomial identities belonging to V. This correspondence in “onto” 
because every variety has a T-ideal. Let Vi #4 Vo and 2(V;) = Uj, ¢ = 1,2. 
Then there exists a polynomial f(a#1,...,@,) which is in Vj \Vo (or in Vo\ V4). 
Obviously, f(%1,...,@,) = 0 is a polynomial identity for GJ, and is not a 
polynomial identity for the relatively free algebra F(W2) = K(X)/Vo € Uo. 
Hence %, 4 Wy and z is 1-1. Clearly, UW, D We if and only if every polynomial 
identity for GW, is satisfied also by Wa, i.e. T(W1) C T(Ba). 


Remark 2.2.8 If U; C We, then T(%) D> T (M2) and we may consider the 
polynomial identities of Uy modulo T(Wzy). Hence if we know the polynomial 
identities of Wy and want to study the polynomial identities of 3, we may 
work in the relatively free algebra F(a). 


2.3 The Theorem of Birkhoff 


Definition 2.3.1 Let % be a class of algebras. We denote by CU, SW and 
ON, respectively, the classes obtained from 2% by taking all Cartesian sums 
(i.e. “direct sums” allowing infinite support), subalgebras and factor algebras 
of algebras in 3. 


Theorem of Birkhoff 2.3.2 A class of algebras U is a variety if and only 
if G is closed under taking Cartesian sums, subalgebras and factor algebras, 


Le. CU,SV,OVCM. 


Proof. (i) Let % be a variety and let R; € YU, 7 € J. The Cartesian sum 
R= ies R; consists of all “sequences” (r; | j € J), rj © Rj, with coordi- 
natewise defined operations. Let f(a1,...,@n) be a polynomial identity for 


Ber Yr) eR, = (rl | je), then 
P(r.) = (FO?!) [7 Ed). 


Clearly, all coordinates of f(r),...,r) are equal to 0 and R € WG. The 
proof of SY, OW C W is similar. 

(ii) Let CU, SU, OW C VG and let T(W) C K(X) be the set of polynomial 
identities satisfied by the algebras in U. We denote by 2% the variety defined 
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by the identities from T (2). Obviously, 8 C MW. We shall show the equality 
GJ = W. Let m be an arbitrary cardinality and let Y = {y; | i € I} be a set 
with m elements. Let N be the set of all elements f(a1,...,@,) of K(X) which 
are not polynomial identities for U. We present A(Y) as a disjoint union of 
two subsets T,, (J) and N,, in the following way. Let f(y,,...,4,,) be any el- 
ement of K(Y), where y;,,...,y;, are n different elements of Y (and n < m). 
If f(a1,...,an) € T(B) , then we assume that f(yi,,...,¥,) © Tm(%) and if 
f(#1,...,8n) is not a polynomial identity for %, i.e. f(ai,...,2n) € N, then 
Yirs---) Yi, © Nm. For every f = f(yi,,.--,4:,.) € Nm there exists an algebra 
Ry € GW and elements rj,;,...,7ri,¢ € Rp, such that f(ri,y,..., 71,7) #0 im 
Ry. For any i € 1,714 %,...,%n, we choose arbitrary elements rj, € Ry and 


define elements 
a =(ryp lie le So Ry. 
FENm 

Since CU,SU C W, we obtain that the algebra F' generated by z in 
yo teNm Ry belongs to %. On the other hand, if g(m,,...,yi,) € Nm, then 
g(Zi,---;2%i,) F 0, because g(riyg,---, Ying) F O for any g © Nm. Hence 
the kernel of the canonical homomorphism K(Y) —> F extending y; > ;, 
i € I, coincides with T,,(%) and F is isomorphic to F,,(2%), the relatively 
free algebra of rank m in 2. Finally, since O37 C W%, and every m-generated 
algebra in 20 is a homomorphism image of Fi, (25) which is in UW, we obtain 


that MCV, ie. V=W 


Definition 2.3.3 For a class of algebras ¥ or for an algebra R, we denote by 
var or by varR the variety of algebras defined by the polynomial identities 
from T(%) or T(R) and call this variety the variety generated by U or R. 
Instead of F,(varR), sometimes we shall denote the relatively free algebras 


of varR by F(R). 


The following corollary is a consequence of the proof of Theorem of 


Birkhoff 2.3.2. 
Corollary 2.3.4 [f U is a class of associative algebras, then varU = OSCR. 
Exercise 2.3.5 Let R be a finite algebra (i.e. |K| < co and dimR < oo). 


Show that varR is locally finite, i.e. every finitely generated algebra in varR 
is finite. Prove that |F,(varR)| < |R|I#I", m EN. 








Hint. Show that in the proof of Theorem of Birkhoff 2.3.2, in the construction 
of the relatively free algebra F, (var) it is sufficient to consider direct sums 
of |R|™ copies of R instead of Cartesian sums. 


Exercise 2.3.6 If A = F, is the field with ¢ elements, show that the poly- 
nomial identities 
[x,y] = 9, at —2=0 


26 2. Algebras with Polynomial Identities 
form a basis for the polynomial identities of the K-algebra Fyx. 


Hint. Use that for any 0 A f(#1,...,2m) € K[X]/(a? — « | & € X), 
there exist some a; € Fyn such that f(ai1,...,a@m) # 0. Hence the alge- 
bra K[X]/(#" —« | « € X) is relatively free in the variety defined by the 
identities [a, y] =0 and «?” — 2 =0. 


Exercise 2.3.7 Show that the polynomial identity «? — « = 0 implies the 
commutativity. 


Hint. Use that 





0=(a@+y)? —(@+y) = (#? — 2) + (y —y) + (wy t yx) = ey + yo 


and obtain the anticommutative law. For x = y it gives 227 = 0 and, since 
x? = x, we see that 22 = 0. Therefore x = —# and the anticommutativity is 


equivalent to the commutativity. 


Remark 2.3.8 It is known that the polynomial identity «” — « = 0 implies 
the commutativity, i.e. in Exercise 2.3.6 one needs only «?" — x = 0. See the 
book by Herstein [126] for the proof. 


Remark 2.3.9 In a similar way, as in Definition 2.2.1, starting, respectively, 
with the free Lie algebra and the free group, one can introduce polynomial 
identities and varieties of Lie algebras, identities and varieties of groups, etc. 
and to prove Theorem of Birkhoff 2.3.2 and Corollary 2.3.4. See the books 
by Bahturin [21] and Hanna Neumann [192] for details. 


3. The Specht Problem 


One of the reasons for introducing the notion of varieties of groups and al- 
gebras is that the varieties give some rough classification of all groups and 
algebras in the language of identities. Of course this classification is very 
rough. For example, as we shall see in Exercise 4.3.7, the only variety of 
commutative algebras over an infinite field is the variety of all commutative 
algebras. From this point of view, commutative algebra is “trivial”. Since we 
want to classify all varieties, the first question is whether we can do this in 
finite terms. In this chapter we consider the Specht problem, whether every 
variety of algebras can be defined by a finite system of polynomial identities. 
Together with a similar problem in group theory, this has been one of the 
main problems in the theory of varieties of groups and algebras for more 
than 30 years. Even now, it is still open for some classes of groups and al- 
gebras. Here we give an example of a variety of Lie algebras over a field of 
characteristic 2 which has no finite basis of its polynomial identities. 


3.1 The Finite Basis Property 


Definition 3.1.1 A variety of associative algebras % is called finitely based 
(or has a finite basis of its polynomial identities) if GW can be defined by a 
finite system of polynomial identities (from the free associative algebra K(X), 
X = {x1,a2,...}). If W cannot be defined by a finite system of identities, it 
is called infinitely based. If all subvarieties of U, including W itself, are finite 
based, U satisfies the Specht property. One defines similarly the finite/infinite 
basis and Specht properties for varieties of groups, Lie algebras, etc. 


The following problem was asked first for groups by B.H.Neumann in 
his thesis in 1935 (see also his paper [191] in 1937 and the book by Hanna 
Neumann [192] for references), then by Specht [245] in 1950 for associative 
algebras over a field of characteristic 0. Now it is known as the Specht problem. 


Specht Problem 3.1.2 Is every variety of associative or Lie algebras finitely 
based? 


The investigations on the Specht problem are in two directions: 
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(i) To show that some varieties satisfy the Specht property. 
(ii) To construct countereramples to the Specht problem, i.e. ecamples of 
varieties which have ho finite basis for their identities. 


The first significant result in direction (i) is due to Sheila Oates and 
Powell [197] who established that a variety generated by a finite group has 
the Specht property. Later, the method was extended to the case when the 
variety is generated by a finite ring with some reasonable conditions on the 
ring (e.g. for Lie rings, associative rings, etc.), see the book by Bahturin [21] 
for details. 


Theorem 3.1.3 Let R be a finite group, a finite associative or Lie ring, a 
finite dimensional associative or Lie algebra over a finite field. Then varR 
has the Specht property. 


Another group of results shows that the variety has the Specht property 
if it satisfies an identity of some special kind. Finally, in a series of papers, 
Kemer (see his book [142] for details) developed a powerful and complicated 
technique which allowed him to show that the T-ideals of the free associative 
algebras over a field of characteristic 0 have many properties of the ideals 
of the polynomial algebras with commuting variables. In particular, in 1987 
Kemer gave a positive solution of the Specht problem for associative algebras 
over a field of characteristic 0. 


Theorem 3.1.4 (Kemer) Every variety of associative algebras over a field 
of characteristic 0 has a finite basis for its polynomial identities. 


Concerning (ii), the first counterexamples were given in group theory by 
Olshanskii [201] in 1970 and then by Adyan [5] and Vaughan-Lee [258], also 
in 1970. Maybe the simplest example is due to Kleiman [149] and Bryant [39] 
who showed that the system of group identities 


(a7 ...e2)*=1,n=1,2,..., 


does not follow from any of tts finite subsystems. This system of identities 
defines the variety which is usually denoted by 8482 and consists of all 
groups G' such that the normal subgroup H of G generated by all squares is 
of exponent 4. 

For Lie algebras the first counterexample was given by Vaughan-Lee [259] 
in 1970 for Lie algebras over a field of characteristic 2 and then in 1974 
by Drensky [69] and Kleiman (unpublished) for Lie algebras over a field of 
any positive characteristic. It was also shown in the papers by Vaughan-Lee 
and Drensky that Theorem 3.1.3 cannot be generalized to the case of finite 
dimensional Lie algebras over an infinite field of positive characteristic. In 
particular, the result of Vaughan-Lee [259] (as precised in the Ph.D. thesis 
of Drensky [70] for (ii)) is the following. 


3.2 Lie Algebras in Characteristic 2 29 


Theorem 3.1.5 Let K be a field of characteristic 2. 
(i) Let be the variety of Lie algebras defined by the polynomial identities 


[[[x1, v2], [vs, e4]], v5] = 0 (the centre-by-metabelian identity), 


[[wi, @2,@3,...,€n], [a1, e2]] = 0, n = 3,4,... 


Then GU has no finite basis of its polynomial identities. 

(ii) Let K be infinite and let My be the variety of Lie algebras generated 
by the Lie algebra M,(K)”? of 2 x 2 matrices with entries from K. Then 
Mo has no finite basis for its identities and the polynomial identities in (1) 
together with the polynomial identities 


[[z1,%4,25,...,%n],[2, v3] + [[e2, t4,25,...,@n], (23, e1]]+ 


[[ws, @4,05,-.-,2n],[e1, e2]] =0, n= 4,5,..., 


form an infinite basis of the identities for Me. 


Another example of a variety of Lie algebras over a field of characteristic 
2 without finite basis of its polynomial identities was given also by Vaughan- 
Lee [260] in 1975. His proof is based on work in the universal enveloping 
algebra of the free nilpotent of class 2 Lie algebra. We forward to the book 
by Bahturin [21] for a further survey on the Specht problem for Lie algebras. 

Concerning finite nonassociative algebras, Polin [207] constructed a simple 
example showing that Theorem 3.1.3 does not hold for arbitrary finite rings 
and algebras. 

Up till now, the Specht problem is still open for associative algebras over a 
field of positive characteristic and for Lie algebras over a field of characteristic 


0. 


3.2 Lie Algebras in Characteristic 2 


The main purpose of this section is to give a proof of Theorem 3.1.5 (i). We 
start with some constructions of Lie algebras. 


Exercise 3.2.1 Let G be a Lie algebra and let D(G) be the vector space of 
all derivations of G, 1.e. linear operators of the vector space G which satisfy 


d([u, v]) = [6(u), v] + [u, 6(v)], 6 € D(G), u,v EG. 


(i) Show that D(G) is a subalgebra of the algebra (EndxG)‘~), the Lie 
algebra of all linear operators of G. (Compare with Exercise 2.1.15 (i).) 

(ii) Let D C D(G) be a Lie algebra of derivations of G and let the algebra 
HT be defined by H = G6 D as a vector space, with multiplication given by 


[91 +41, 92 +62] = ([91, 92] +61 (92) —92(91)) +[61, 52], gi € G6; € D, 7 = 1,2. 
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Show that H is a Lie algebra and G is an ideal of H. 


Exercise 3.2.2 Let G be a Lie algebra and u € G. Show that 
adu:v > [v,u], v EG, 


is a derivation of G. Such a derivation is called inner. 


Till the end of the proof of our main result Theorem 3.1.5 (i) we assume 
that chark = 2. Therefore, for any Lie algebra G, we obtain that g = —g 


and [92,91] = [91, 92], 9,.91,92 © G. 
Lemma 3.2.3 Letn €N be fired and let 
Cn = K[to,t1,.--,tn]/(tB,8,..., 08). 
Let Gy, be the algebra with basis as a vector space 
{u(to, ti,...,tn)a, t2t, ...t,b | ua monomial with coefficient 1 in Cy}. 


Assuming that uju2b = 0 af uue F tty ...tn, we define the multiplication 
between the basis elements of Gy by 


[ui a, ua] = uzugd, 
and all other products of the basis elements are 0. Then Gy, is a Lie algebra 


which is nilpotent of class 2 (t.e. satisfies the identity [x1, #2, £3] = 0). 


Proof. Since the product of any three elements of the algebra G,, is equal 
to 0, we obtain that G', satisfies the identity [a1,a2,v3] = 0 and the Jacobi 
identity. We shall complete the proof if we establish the anticommutativity, 
which is equivalent to show that 


[uia, uea] = —[uoa, ural, [ua, ua] = 0 


for any monomials wu 1, U2, u € C,. Since charK = 2, the definition of the mul- 
tiplication between u a and wea gives the first equality. The second equality 
[ua, ua] = 0 is also obvious because n > 0 and u? F 26)... tn. 


Lemma 3.2.4 In the notation of Lemma 3.2.3, let do, d,,..., dy, be the linear 
operators of G', defined on the basis vectors of Gy, by 


d;(ua) = tjua, d;(ub) =0,i=0,1,...,n. 


Then the vector space D,, = span{do, d1,...,dn} spanned by do, di,...,dy ts 
an abelian (i.e. commutative) Lie algebra of derivations of Gy. 


Proof. Clearly, d;,d;, 7,7 = 0,1,...,n, are commuting linear operators of 
G,. It is sufficient to see that each d; acts as a derivation on the products of 
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basis elements of G,. The only nontrivial case is d;([u ia, wea]) (why?). Direct 
calculations show 


d;([uia, uea]) = d;(u1u2b) = 0, 
[d;(u a), wea] + [uia, dj(uga)] = (tp )ue)b + (ur (t;u2))b = 0, 
d;([uia, uea]) = [d; (ura), wea] + [wia, dj(uga)] = 0, 
and this means that d; is a derivation. Since [d;, d;] = 0, the vector space D, 


is an abelian Lie algebra. 


Lemma 3.2.5 In the notation of the previous two lemmas, let H, = Gpn®PDp 
be the Lie algebra obtained as in Exercise 3.2.1 (ii). Then: 
(i) The algebra Hy, satisfies the central-by-metabelian polynomial identity 


[[21, xo], [@3, tal, £5] = 0. 


(ii) The centre of Hy, (i.e. the set C(H,,) of all elements z © Hy such that 
[Hn, 2] = 0) coincides with the vector space spanned on t2t, ...tnb. 


Proof. (i) One can directly see that the commutator ideal H!, = [Hy, Hp] is 
contained in G, and Gi, = Kept, ...t,6. This implies that 


Hi = (HL) C Gi, = Ktotr...tyb- 


The definition of the multiplication in H,, shows that the element tty Lot, b 
is in the centre of H,,. This gives that [[Hn, Hn], (Hn, Hn], Hn] = 0, ie. Hn is 
centre-by-metabelian. We leave part (ii) of the lemma for exercise. 


Proposition 3.2.6 The algebra H, satisfies the polynomial identities 
Fie (a1, oe ., tk) = [[v1, £2, &3, a tl, [%1, £2] = 0, k > 3,h # n+ 2, 


and fr4o(%1,.--,;€n42) does not vanish on Hy. 


Proof. In order to show that fr+o a 0 in Hy, we replace the variables by 
f, = a, fo = do, &} = dj_2 fori = 3,...,n +2. Direct calculations show that 








Fnoo(€1,---,®n4e) =toti...tnbd £0, 


and frse # 0in H,. Now, let & 4 n+2 and let us assume that f,(#1,...,2%) F 
0 for some #; € H,. Since fx(#1,...,£x) is linear in each of the variables Z;, 
t= 3,...,k, without loss of generality we may assume that z;, i = 3,...,k, 
are some of the basis elements of H,. Additionally, we may assume that 


[@4, £9] =cqat+ cob, a, be Ch, 


is some element of G,,. It is easy to see that the only elements which give 
a nonzero contribution to f,(%1,...,2,) are [%1, £2] = ca, where c € Cy, is 
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a polynomial without constant term, %; = d;,, 7 = 3,...,&. Hence, we may 
assume that 


fr(#1,.--,€n) = |ca,d;,,...,d;,, ca] = ety, ...t;,b F# 0. 


Let ¢ = }>a;u;, where u; are monomials of C, and a; € K. The associative 
algebra C,, is commutative and chark = 2. Hence 


f= (DY laju)’ = di ajuy, 


and this means that it is sufficient to assume that [%,, £2] = ua for some 
monomial u of degree > 1 in C,,. Now 


fr(Z1, .. ., &h) = uti, .. .U;,6 # 0. 
The nonzero elements fb € H,, are multiple of t2t, ...t,60 and we obtain that 


urls, «ti, = ly. .tn. 


ig e+ 


In this way we obtain that u = to (since degu > 0) and k = n+ 2, which is 
impossible. Therefore, f,(@1,...,%) =O in Hp. 


Theorem 3.2.7 (Vaughan-Lee [259]) Let K be a field of characteristic 2 and 
let UW be the variety of Lie algebras defined by the polynomial identities 


e(#1, €2,%3,04, 5) = ([[z1, xo], [v3, xa], a5] = 0, 


fn = [[v1, 2, %3,---,€n], [@1, £2]] = 0, n = 3,4,... 


Then GU has no finite basis of its polynomial identities. 


Proof. Let U be defined by a finite system of polynomial identities 
9i(%1,-.-,%m) =0,¢=1,2,...,k. 


All g; belong to the T-ideal V = T(%3) generated in the free Lie algebra L(X) 
by 


(41, €2,%3,€4,%5), fn(41,..-,8n), n= 3,4,..., 


and V is generated by g;, 7 = 1,...,&. Clearly, each g; depends on a finite 
number of f,,’s and, since the number of the g;’s is finite, we obtain that as a 
T-ideal V is generated by ¢(#1, ¥2, #3, £4, 5) and a finite number of f,,’s, say 
fs,..., fx. Hence, every centre-by-metabelian Lie algebra H which satisfies 
the identities f, = 0, n < k, satisfies also all other identities f, = 0, n> k. 
This contradicts with Proposition 3.2.6 and completes the proof. 


The existence of an infinitely based variety allows to construct different 
examples. 
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Corollary 3.2.8 Over a field of characteristic 2, there exist continually many 
varieties of Lie algebras. 


Proof. Let S be any subset of N \ {1,2}. We define a variety Ws of centre- 
by-metabelian Lie algebras (i.e. we assume that Uy satisfies 


[[z1, v2], [vs, v4], x5] = 0) 
by the polynomial identities 
fr(%1,-.-, ne) = [[1, 2, %3,..., en], [€1, %2]] =0,k ES. 


Let S # S’ and let, for example, k € S \ S’. Proposition 3.2.6 gives that 
the algebra Hz_2 does not satisfy the identity f, = 0. Hence Hx_2 does 
not belong to %y. From the other hand, Hz_2 satisfies all other polynomial 
identities f, = 0,n = 3,4,..., hence H, € Us. In this way all varieties Us 
are different. Since the subsets of N \ {1,2} are continually many, we obtain 
the statement of the corollary. 


Corollary 3.2.9 Over a field of characteristic 2, there exist continually many 
varieties of Lie algebras which form a chain with respect to the inclusion. 


Proof. We use a trick of Olshanskii [202]. Let Q = {ri,ro,...} be the set 
of the rationals. For any a € R we construct a variety M., consisting of all 
centre-by-metabelian algebras satisfying the polynomial identities 


fi (#1, oe th) = [[z1, %2, 3, a ., &k], [x1, #2] = 0, Tk > Qa. 


As in Corollary 3.2.8, we see that if a < @, then 8, is a proper subvariety 
of Wy (because Wi, satisfies more identities than WW). 


Exercise 3.2.10 Show that over any field the free centre-by-metabelian Lie 
algebra (sometimes one uses the notation F'((2, €]) = L/[L", L], where L = 
E(X)) is spanned as a vector space by 


[Ti,, Viaser ey Lind; lz 51, Usayrres xj,J; [Th kel], 


where ty > tg S13 <<... tm, > jo < 93 <<... < In, ki > ka, m= 1,2,..., 
n= 2,3,... 


Hint. See the paper by Vaughan-Lee [259] or the book by Bahturin [21]. 
Another possibility is to try to solve the exercise after reading Chapter 5. 


Exercise 3.2.11 In the case of characteristic 2, show that there exists an 
m €N such that the algebras H,, defined in Lemma 3.2.5 satisfy the Engel 
identity 
to(ad” x1) = [vo,a1,..., 81] = 0. 
— 


m times 
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Hint. Use that in H, the only nonzero contribution in 
([A1, ha|(ad*h), hs], h, hy, ho, hg € Ay, 
comes from A € D,, and use that 


ad?(h! + hl’) = ad?h! +. ad?h", h!,h! € Dn. 


Exercise 3.2.12 Derive from Exercises 3.2.10 and 3.2.11 that the varieties 
in Corollaries 3.2.8 and 3.2.9 can be chosen locally nilpotent (i.e. their finitely 
generated algebras are nilpotent) and hence locally finite dimensional. 


Hint. See the book by Bahturin [21] or the paper of Drensky [73] for further 
comments. 


Exercise 3.2.13 Let chark = 2 and let U = U(F(%2)) be the universal 
enveloping algebra of the free nilpotent of class 2 Lie algebra F(z). Let J 
be the minimal ideal of U containing the elements 


fa = [x1, 2][x2, x3] oa -[2n-1, €n][tn, 21] € U, n= 2,3, sey 


and such that J is closed under all linear transformations, i.e. u(a1,...,@n) € 
J implies that u(}7 aja;,..., > ainai) € J for any aj € K. Show that J 
is not finitely generated as an ideal invariant under linear transformations. 
Derive from here, that the variety of Lie algebras over a field of characteristic 
2 defined by the polynomial identities 


[[z1, v2, a3), [r4, U5, x6] = 0, 
(this identity defines the variety AN» of abelian-by-nilpotent of class two Lie 
algebras) and 
In = [ly, ¥2, Ys], [21, 22], [@2, v3], ---,[%n-1, En], (tn, ei]] = 0, n = 2,3,..., 


is not finitely based. 


Hint. This is the result of Vaughan-Lee [260] mentioned above. See the orig- 
inal paper for help. 


Remark 3.2.14 If chark = 0, the previous exercise is not true. In this case 
Volichenko [264] showed that the ideals of U = U(F(Sty)) invariant under 
linear transformations are finitely generated in the class of all such ideals. It 
was one of the main steps of his proof that every subvariety of the variety 


AM is finitely based. 


Problem 3.2.15 Let chark = p > 0. Are the subvarieties of 29%. finitely 
based? Are the subvarieties of 20, finitely based? (The variety 291, is defined 
by 
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[[a1, TQ +025 tpi), [vi Y2,-- +, Yp+ill = 0.) 


Very probably the answer to the second question is negative. 


Problem 3.2.16 Let KK be an infinite field of odd characteristic p. Does the 
Lie algebra M,(K)O have a finite basis for its polynomial identities? Very 
probably the answer is negative. 


Problem 3.2.17 Let K be an infinite field of characteristic 2. Does the 
associative algebra Ms(K) have a finite basis for its polynomial identities? 
This algebra is considered as the most promising candidate for the negative 
solution of the Specht problem for associative algebras over a field of positive 
characteristic. No guess whether the answer is positive or negative. 


Exercise 3.2.18 Let V2 be the 2-dimensional vector space with a fixed ba- 
sis €1,€2 and let Us(K) be the subalgebra of upper triangular matrices in 
M.2(K). Assuming that U2(A) and Mo(K) act on V2 from the right, define 
nonassociative algebras 


Ry =We Us(K), Ro =VWo® M2(K) 
as vector spaces and with multiplication 
(vy + a1)(v2 + a2) = U1 42, UE Vo, aj E Mo(K), i= 1,2. 


Show that R; and Re are left nilpotent, i.e. x1 (#243) = 0 and have no finite 
basis for their polynomial identities. 


Hint. The algebra Ry is the example of Polin [207] and the proof for the 
infinite basis property for R, is similar, based on the fact that the ideal of 
the polynomial identities of U2(K) is not finitely generated as a left ideal 
invariant under linear transformations. See the paper by Polin [207] for more 


help. 


4. Numerical Invariants of T-Ideals 


This chapter is devoted to the reduction of arbitrary polynomial identities 
to polynomial identities of special form: homogeneous, multilinear and the 
so called proper polynomial identities. We also introduce the most important 
numerical invariants of T-ideals: the codimension sequence of the T-ideal 
and the Hilbert series of the corresponding relatively free algebras. We find 
various relations between the ordinary and the proper numerical invariants. 
The considered reductions turn to be very useful both for studying the general 
properties of PI-algebras and in the investigation of the polynomial identities 
of concrete important algebras. 


4.1 Graded Vector Spaces 


Definition 4.1.1 The vector space V is graded if it is a direct sum of its 
subspaces V("), n > 0, Le. 


veQvm =v. 


n>o n>o 


The subspaces V‘") are called the homogeneous components of degree n of 
V. If no misunderstanding, we shall write 5~> instead of s7°. Similarly, we 
introduce a multigrading on V if 


y= 3 S- YP Rm) 


i=1n,>0 


and call V("1»-9"™) its homogeneous component of degree (n1,...,%m). The 
subspace W of the graded vector space V = }7,, V) is a graded or (ho- 
mogeneous) subspace if W = >> (Wa V)). In this case, the factor space 
V/W can also be naturally graded (and we say that V/W inherits the grading 
of V). 


Example 4.1.2 (i) The polynomial algebra A[x1,...,#m] is graded assum- 
ing that the homogeneous polynomials of degree n (in the usual sense) 
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are the homogeneous elements of degree n. Similarly, K[a1,...,@m] has a 
multigrading, counting the entry of each variable in the monomials. Analo- 
gously, one can define grading and multigrading on the free associative al- 
gebra K(a1,...,%m) and a grading on K(X). Usually we shall assume that 
K[@i,...,€m] and K(x1,...,@m) are equipped with these two gradings. 

(ii) The Grassmann algebra has a grading inherited from the grading of 
K(X). 


Definition 4.1.3 Let V = S7.y, V‘) be a graded vector space and let 


dimV‘) < oo. The formal power series 


H(V,t) = Hilb(V,t) = So dimv™” 
n>o 


is called the Hilbert (or Poincaré) series of V. For a function f(t), we make 
the usual convention that H(V,t) = f(t) if the series H(V,t) converges in 
some neighbourhood of 0 and the functions H(V,t) and f(t) are equal there. 
Similarly, if the vector space 


Ve > Va me) n-= (n1, Leeg Mm), 


is multigraded, then the Hilbert series of V is 


H(V,t1,-..5tm) = Hilb(V, t1,...,tm) = So dimV ror 4 the 


Exercise 4.1.4 If V = > V) and W = 3. W) are graded vector spaces 
with the same (multi)grading, show that V @ W is also graded assuming that 
its homogeneous components are 


(VewM= SO vow, 
nitnii=n 
Exercise 4.1.5 Considering graded vector spaces, prove that 
Hilb(V @ W,t) = Hilb(V,t) + Milb(W, 2), 
Hilb(V @ W,t) = Milb(V, t)Hilb(W, ¢), 
Hilb(V, t) = Milb(V/W, t) + Hilb(W,t), W CV. 


Exercise 4.1.6 Prove that 


1 
1-¢t;’ 





1 m 
Hilb(K[«],t) = — Hilb(K[e1,.-.,%m],t1,-.-,4m) =] 
t=1 
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1 
~ L= (ti +...+tm) 
1 


~ Tmt’ 


Hilb(K (x1, oe 52m), t1, oe .stm) 


3 





Hilb(K (a1,...,%m),#) 


Exercise 4.1.7 Consider the Grassmann algebra E'(V;,) of the m-dimens- 
ional vector space V,, with basis {e1,...é¢m } with the natural (multi)grading 
counting the entries of e; in the monomials e;,...e¢;, (as in Exercise 4.1.2 


(ii)). Show that 


Hilb(B (Vn), t1,---.tm) = > ej(ti,--.,tm) = [JU +4), 
j=o i=l 
where e;(t1,...,¢m) 1s the j-th elementary symmetric polynomial. 


4.2 Homogeneous and Multilinear Polynomial 
Identities 


Now we go back to Pl-algebras. Recall that the polynomial identity g = 0 is 
a consequence of (or follows from) the polynomial identities f; = 0, 7 € J, if 
9 € (fi |i € I)?, the T-ideal generated by f;, i € I. 


Definition 4.2.1 Two sets of polynomial identities are equivalent if they 
generate the same T-ideal. 


Definition 4.2.2 A polynomial f(x1,...,2,) in the free associative algebra 
K(X) (or in its graded subalgebras and factor algebras) is multilinear of 
degree n if f is multihomogeneous of degree (1,...,1) in K(aq,...,@n) C 
K(X). We denote by P, the vector space of all polynomials in K(X) which 
are multilinear of degree n. (Very often one uses V,, instead of P,,, but we have 
reserved V,, for the n-dimensional vector space.) Clearly, P, is of dimension 
n! and has a basis 


{®o(1) ++ g(n) | CE Sn}. 
Proposition 4.2.3 Let 
i=0 


where f; is the homogeneous component of f of degree i in x1. 
(i) If the base field K contains more that n elements (e.g. K is infinite), 
then the polynomial identities f, = 0,1 =0,1,...,n, follow from f =0. 
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(ii) If the base field is of characteristic 0 (or if charK > degf), then f =0 


ts equivalent to a set of multilinear polynomial identities. 


Proof. (i) Let V = (f)? be the T-ideal of K(X) generated by f. We choose 


n-+ 1 different elements a9,a1,...,@, of kK. Since V is a T-ideal, 
n 
flajx1,@2,...,&m) = So oj fi(wi,a2,...,2m) EV,7=—0,1,...,n. 
i=0 


We consider these equations as a linear system with unknowns f;, 7 = 
0,1,...,. Since its determinant 


1 ap az... a? 

1 a, a? ay 

1 2 = [Iti ai) 
a, a5 as i<j 

1 an az ar 








is the Vandermonde determinant and is different from 0, we obtain that each 
fi(t1,...,€m) also belongs to V, i.e. the polynomial identities f; = 0 are 
consequences of f = 0. 

(ii) We use the process of linearization. By (i) we may assume that 
f(#1,...,%m) is homogeneous in each of its variables. Let deg, f = d. We 
write f(y. + yo, @2,...,%m) € V in the form 


d 
fim + Yo, %2,..., 8m) = So fila, yas @2,---52m); 
t=0 


where f; is the homogeneous component of degree i in y,. Hence f; € V, 
t~=0,1,...,d. Since deg, fi <d,i=1,...,d—-1, 7 = 1,2, we apply inductive 
arguments and obtain a set of multilinear consequences of f = 0. In order to 
see that these multilinear identities are equivalent to f = 0, it is sufficient to 
see that F 

Fi(Yr, Yt, %2;--+,8m) = (<) fossa. ++; 8m); 


2 


and the binomial coefficient is different from 0 because char = 0 orchark = 
p>d. 


Exercise 4.2.4 Let chark = 0. Find a system of multilinear polynomial 
identities which is equivalent to each of the polynomial identities: 
ELL L + 2x, eoxi + xox x3 = 0; 


2 pe . pt . 
ry =0; 27 =0; x} =0; 
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[ay, ay]? = 0. 


Exercise 4.2.5 Show that any Pl-algebra (over any field) satisfies a multi- 
linear polynomial identity. 


Hint. Starting with an arbitrary polynomial identity f(#1,...,%m) = 0, use 
that the degree in y, and the degree in yz of the consequence of f = 0 


GY, Y2,€2,---,em) = fyi + ye, 2,...,€a)— 


—f(¥1,2,...,8m) — F(yo, €a,..-,€m) 


are smaller than the degree of f(#1,...,#m) in a and if deg, f > 1, then 
deg, ,9 >0,7 = 1,2. 


In the case of characteristic 0 every T-ideal is generated by its multilinear 
polynomial identities. Traditionally, many of the results on Pl-algebras are 
also stated in the language of multilinear identities. 


Definition 4.2.6 Let R be a Pl-algebra with T-ideal TR). The dimension 
of the multilinear polynomials in K(X) modulo the polynomial identities of 
R is called the n-th codimension of the T-ideal T(R) or of the polynomial 
identities of R and is denoted by c,(R) (or by cy (%) if we consider the 
T-ideal of the polynomial identities of a variety ¥), i.e. 


én(R) = dimP,/(P,pNT(R)), n= 0,1,2,... 
This sequence is called the codimension sequence of the T-ideal T(.R). We 
also consider the codimension series and the exponential codimension series 
t”? 


c(R,t) = So en(B)t”, (Rt) = S> en(R) = 


nt 
n>o n>o 


Convention 4.2.7 If R is an algebra (or U is a variety of algebras), for any 
subset S of K(X) we denote by S(R) (or by S(%)) the image of S under the 


canonical homomorphisms 
K(X) — F(R) = F(varR) = K(X)/T(R), K(X) — F(%). 
For example, the formula for the codimension sequence of T(R) is 
én(R) = dimP,,(R), n = 0,1,2,... 
Definition 4.2.8 Let the T-ideal T(R) be a (multi)homogeneous ideal of 


K(X). (For example this holds if the base field K is infinite.) Then we denote 
by 
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H (F(R), t1,-..,tm) = Hilb(Pn(R),t1,...5tm) 


the Hilbert series of the relatively free algebra of rank m in varR. If we are 
interested in the Hilbert series in one variable, we write 


(F(R), t) = Hilb,, (varR,t) = Hy, (varR,t) => dimF” 
n>o 


4.3 Proper Polynomial Identities 


The considerations in Section 4.2 hold for any class of algebras with the 
corresponding restrictions on the base field K. To the best of our knowledge, 
they were stated for the first time in the papers by A.I.Malcev [180] and 
Specht [245] in 1950. In what follows we shall use essentially that we work 
with unitary associative algebras. This investigation was started by Specht 
[245] in 1950 and in the quantitative way as here by Drensky [74], see also 
the expository paper [77]. The approach of [74] is based on an idea used 
by Volichenko [262] to describe the algebra K(X)/T(R) in the special case, 
when charAk = 0 and the T-ideal TR) is generated by [a1, #2, #3, xa]. 


Definition 4.3.1 A polynomial f € K(X) is called a proper (or commutator) 
polynomial, if it is a linear combination of products of commutators 


f(@1,..-, 8m =Soai,.., jl@irs 5) Vip) [ty @ jy], OG,..7 © A. 


(We assume that | is a product of an empty set of commutators.) We denote 
by B the set of all proper polynomials in K(X), 


Bm = BO K(a1,...,tm), m=1,2,..., Py =BOP,, n=0,1,2,..., 


1.e. By, 1s the set of the proper polynomials in m variables and Jj, is the set 
of all proper multilinear polynomials of degree n. If R is a Pl-algebra, by our 
Convention 4.2.7, we denote by B(R), By, (R) and I,(R) the images in F(R) 


of the corresponding vector subspaces of A(X). 


Exercise 4.3.2 Let B') be the homogeneous component of degree n of B. 
Show that BO = K, BO =0, and B®) and B®) have respectively bases 


Proposition 4.3.3 (i) Let us choose an ordered basis of the free Lie algebra 


L(X) 


©1,U2,.. -5 (iy, Cig], [@ 15 € jo], sey [Thy Cho, &ks], tty 
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consisting of the variables %1,x%2,... and some commutators, such that the 
variables precede the commutators. Then the vector space K(X) has a basis 


eft... 2@[x;,,2i,]°...[ei,,---,21,]° 
where @1,...,4m,6,...,¢ > 0 and [#;,, 44.) <...< [a1,,...,a,] tn the order- 
ing of the basis of L(X). The basis elements of K(X) with a, = ...= am = 0 


form a basis for the vector space B of the proper polynomials. 

(ii) If R is a unitary Pl-algebra over an infinite field K, then all polyno- 
mial identities of R follow from the proper ones (i.e. from those in T(R)NB). 
Ifchark = 0, then the polynomial identities of R follow from the proper mul- 
tilinear identities (t.e. from those in T(R)OTy, n = 2,3,...). 


Proof. (i) The first statement about the basis of K(X) follows from Witt The- 
orem 1.3.5 (that the free associative algebra K(X) is the universal enveloping 
algebra of the free Lie algebra L(X)) and from Poincaré-Birkhoff-Witt Theo- 
rem 1.3.2 which gives the basis of U(G) for any Lie algebra G. The statement 
about the basis of B also follows from Poincaré-Birkhoff- Witt Theorem. If 
we express the product of commutators 


[ips ey Pin] es [ty & yy] 


as a linear combination of the basis elements of K(X), the key point is that 
for any two consecutive commutators (both from the basis of L(X)) which 
are in a “wrong” order, e.g. 


 [@b,,---5 Co] [@ar,-- +5 Lan] +s [oy5---, 26] > [tas -- +, Can]; 
we replace the product 
[@o.5---5 bh] [@ar,--- Cag] 
by the sum 
[Wars s+ +> Cag] [@or,---5 Ce] + [[@6,5---5 Cel; [ars --) Lael]. 


Since the second summand belongs to L(X) and is a linear combination of 
commutators from the basis of L(X), we can apply inductive arguments and 
see that the elements of B are linear combinations of 


[ais, a;,]° sae [au sey ri), 


as required. 
(ii) Let f(a1,...,%m) = 0 be a polynomial identity for R. We may assume 
that f is homogeneous in each of its variables. We write f in the form 


f= Saat? ...28"walt1,.-.,&m), aq € K, 


where We(#1,...,%m) is a linear combination of 
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[ais, x;,]° wee [au sey a1]. 
Clearly, if we replace by 1 a variable in a commutator [#;,,...,#;,], the 
commutator vanishes. Since f(1 + #1, %2,...,%m) = 0 is also a polynomial 


identity for R, we obtain that 


ay a 
f(l4+41,82,...,€m) = So aa >> ( 2) abe 1 tS wWe(t1,...,em) € T(R). 
7=0 


The homogeneous component of minimal degree with respect to #1 is obtained 
from the summands with a, maximal among those with a, # 0. Since the 
T-ideal T(R) is homogeneous, we obtain that 


S- gh? ... 22" wa(@1,--.;&m) € T(R). 


a, max 


Multiplying from the left this polynomial identity by x{! and subtracting 


the product from f(#1,...,%m), we obtain an identity which is similar to 
f(€1,...,%m) but involving lower values of a. By induction we establish 
that 


S- ghs? ... 24" wa(@1,---;&m) € T(R), 
ay fixed 


Wa(%1,---, em) € T(R), 


and this completes the proof. The “multilinear” part of the statement is also 
clear. Starting with any multilinear polynomial identity for R, and doing 
exactly the same procedure as above, we obtain that the identity follows 
from some proper identities which are also multilinear. 


Exercise 4.3.4 Express the polynomial 
eoryag + 3x3tory — €%3hon, € K(X) 


as a linear combination of the basis vectors in Proposition 4.3.3. 


Exercise 4.3.5 Let chark = 0. Find a system of proper multilinear polyno- 
mial identities, equivalent to the polynomial identity 


eoryag _ a3tou4 — 81 %3heN, +21, Let3r, = 0. 


Exercise 4.3.6 Let chark = 0. Show that a polynomial f(a1,...,%m) € 
K(X) is proper if and only if the formal partial derivatives Of /0x; are equal 
to 0 fori =1,...,m. Here 0/02; is the derivation of K(X) defined by 


Ou; 


an d;; (the Kronecker symbol). 
aj 
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In virtue of this property, sometimes one calls the elements of I, multilinear 
constants. 


Exercise 4.3.7 Let R be any commutative algebra over an infinite field kK. 
Show that all polynomial identities of R follow from [a 1, x2] = 0. 


Exercise 4.3.8 Let L(X) be the free Lie algebra considered as a subalgebra 
of K(xyO) and let Phy = Py, 1 E(X) be the set of the multilinear Lie 
polynomials. Prove that for n > 1, the vector space PL, has a basis 


{[tn, Vo(1)s oe +) ®o(n—1)] | cE Sn—1}- 


Hint. Using the Jacobi identity and the anticommutative law, show that 


these elements span the vector space PL,. Write [%n,®o(1),---,€o(n—1)] 
as a linear combination of monomials 2,(1)...2%7(n), T © Sn. Show that 
@n€o(1)--+€e(n—1) is the leading term of [#,, %o(1),---,®o(n—1)] with respect 


to the lexicographic ordering in K(X). 
The basis of 7, given in the next theorem is called the Specht basis. 


Theorem 4.3.9 A basis of the vector space I, of all proper multilinear poly- 
nomials of degree n > 2 consists of the following products of commutators 


[vi,, o i, ] a [Pj os 2%], 
where: 
(i) All products are multilinear in the variables x1,...,&n; 
(ii) Each factor [tp,,@p,,...,p,] is a left normed commutator of length 


> 2 and the maximal index is in the first position, t.e. py > pe,...,Ps; 

(iii) In each product the shorter commutators precede the longer, i.e. in 
the beginning of the statement of the theorem k <...< 1; 

(iv) If two consecutive factors are commutators of equal length, then the 
first variable of the first commutator is smaller that the first variable in the 
second one, 1.€. 


6 [@p15Lpos ees Vp] [Pars Faas ++ Fas] + 


satisfies py < qy. 


Proof. We choose a basis of D(X) as in Proposition 4.3.3 (i) assuming that in 
the ordering of the basis the shorter commutators are before the longer ones 
and if two commutators are of the same length, this with a smaller first vari- 
able is before the other. Now the theorem follows directly from Proposition 


4.3.3 (i) and Exercise 4.3.8. 
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Definition 4.3.10 Let R be a (unitary) Pl-algebra over a field of charac- 
teristic 0. By analogy with Definition 4.2.6 we introduce the vector subspace 
Tyn(R) = Tn/ (Pn NT(R)), the proper codimension sequence 


Yn(R) = diml,(R), n =0,1,2,..., 
and the proper codimension series and exponential proper codimension series 


7(R,t) = Son (R)t", (Rt) = SS nS 


n>o n>o 


Now we give some quantitative relations between the ordinary and the 
proper polynomial identities. 


Theorem 4.3.11 Let R be a unitary Pl-algebra over an infinite field K. 
(i) Let 
{w;(#1,-..,%m) | j= 1,2,...} 


be a basis of the vector space By,(R) of the proper polynomials in the relatively 
free algebra F(R) of rank m, i.e. 


Bm(R) = (K(a,...,@m)OB)/(T(R) A K(a1,...,%m) NB). 
Then Fy,(R) has a basis 
{eq ...an"wj(@1,-..,8m) | ai > 0,7 =0,1,...}. 
(ii) If 
{uje(ai,-.-, 2b) |g =1,2,...,9e(R)} 


is a basis of the vector space I,(R) of the proper multilinear polynomials 
of degree k in F(R), then P,(R) has a basis consisting of all multilinear 
polynomials of the form 


Lp, -.-€p,_,Ujk(@q,,---,Lq,), 9 =1,2,...,%e(R),& =0,1,2,...,n, 
such that py <...< Pn—p and qi <...< qk. 


Proof. (i) Let w)(a1,-..,%m) € Bm be homogeneous preimages of the ele- 
ments w;(@1,...,@m) € Bp(R), 7 = 1,2,... We choose an arbitrary homo- 
geneous basis {v, | & = 1,2,...} of Bn AT(R). Then 


{wi (@1,---5@m), 0k | J = 1,2,...,4 =1,2,...} 


is a homogeneous basis of B,,. Applying Proposition 4.3.3, we see that Fi, (R) 
is spanned by 


a a ; 
ay... com w;(@1,...,2m),a > 0,7 =1,2,... 


and these elements are linearly independent. The proof of (ii) is similar. 
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Theorem 4.3.12 Let R be a unitary PI-algebra over an infinite field K. 
(i) The Hilbert series of the relatively free algebra F(R) and its proper 
elements By,(R) are related by 





Hilb(Fin(R), t1,---,tm) = Hilb(Bm(R),t1,.-.,¢m) [] u 


1 
(l—t)r 


(ii) The codimensions and the proper codimensions of the polynomial iden- 
tities of R and the corresponding formal power series are related as follows: 


Hilb( Fn (R), ¢) = Hilb(By,(R),t) 


nm 


en(R) = > ({,) (2 n=0,1,2,..., 


k=0 


1 t 
c(R,t) = Toph To 


é(R,t) = exp(t)4(R, t). 


), 


Proof. The assertion of (i) follows immediately from Theorem 4.3.11 and 
Exercise 4.1.6. The first statement of (ii) also is a consequence of Theorem 
4.3.11 and the equations involving formal power series can be obtained by 
easy manipulation of the first equation of (ii). 


5. Polynomial Identities of Concrete Algebras 


Now we shall apply the results of Chapter 4 to study the polynomial identi- 
ties of the Grassmann algebra and of the algebra of k x & upper triangular 
matrices. Using other methods, the polynomial identities of the Grassmann 
algebra were described by Krakowski and Regev [155], together with some 
additional information. For further development see the papers by Berele and 
Regev [32] and Luisa Carini [45], e.g. for the Hilbert series of the correspond- 
ing relatively free algebra. Using methods similar to ours, another proof was 
given by Di Vincenzo [63]. The basis of the polynomial identities for the 
upper triangular matrices was found by Yu.N. Maltsev [181] in the case of 
characteristic 0 and by several other authors (e.g. Siderov [243], Kalyulaid 
[131], Polin [208], see the book by Vovsi [265]) in the case of any field. See 
also the book by Bahturin [21] for the Lie algebra case. A lot of information 
on the T-ideals, containing all polynomial identities of the algebra of k x k 
upper triangular matrices can be found in the paper by Latyshev [165], see 
also the paper by Drensky [76]. Apart from the Grassmann algebra and the 
upper triangular matrix algebras, over a field of characteristic 0, the bases of 
the polynomial identities of concrete algebras are known in few cases only. 
The most interesting algebras among them are the algebra of 2 x 2 matrices 
(Razmyslov [217] in 1973, and a minimal basis of the identities, Drensky [72] 
in 1981), the tensor square of the Grassmann algebra / @ F' (Popov [209] in 
1982). Over an infinite field of positive characteristic the known results are 
even less. Over a finite field one uses completely another technique and the 
best known results contain the bases of the polynomial identities for 1; (F,) 
for k = 2,3,4, respectively due to Yu.N. Maltsev and Kuzmin [182], Genov 
[111] and Genov and Siderov [113] in 1978-1982. We refer also to the survey 
article [77] for other applications of the methods of our exposition. 
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5.1 Polynomial Identities of the Grassmann Algebra 


In this section we shall describe the polynomial identities of the Grassmann 
algebra and their numerical invariants. We assume that K is a fixed field of 
characteristic 0. 


Lemma 5.1.1 Let G = ([x1,22,23])" be the T-ideal of K(X) generated by 
the polynomial identity [x1, x2, x3]. Then the polynomials 


[a1,vel[ae,e3] and [a1, xe][es, v4] + [ai, es][ae, x4] 


belong to G. 


Proof. We apply the identity 
[uv 2] = [u, who + alo, 0] 


(which expresses the fact that the commuting with a fixed element is a deriva- 
tion) and obtain from [a1, 23,23] € G that 


[v1, #3, 03] = [[e1, e2)e2 + wo[a1, v2], x3] = 
= [r1, 22, e3)%2 + [21, e2][v2, 23] + [r2, es][e1, 22] + vo[e1, 22,23] € G, 
[v1, ®2][2, v3] + [r2, es][v1, 22] € G, 
[v1, e2][v2, v3] + [v2, es][e1, v2] = 2[21, v2] [e2, v3] + [[2, vs], (21, r2]] © G 

and, since [[a, as], [%1, #2]] € G, this gives that 

[v1, €2][@2, v3] € G. 
The linearization of this polynomial is 

[w1, vo) [a5, x3] + [w1, 5][%2, x3] 


(which is the same as [21, €2][%3, v4] + [a1, e3][%2, e4]) and also belongs to G. 
Theorem 5.1.2 Let charkK = 0 and let E be the Grassmann algebra of an 
infinite dimensional vector space. 


(i) The T-ideal T(E) is generated by [x1, vo, ts]. 
(ii) The codimensions of the identities of F satisfy 


¢n(B) = 2"7',n=1,2,..., 


c(B,t) = So en (BE)? = a 


n>o 


~ t” 1 
e(B,t) = Yo en(B)G = 51 +e). 
n>o . 
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(iii) The Hilbert series of the relatively free algebra F,,(F) are 





1 lqa7let; 
HUb(Fm(E)stis stm) = 9 +5 TP 
1 $1/14+t\™ 
Hilb(F,,(£),t) = = + = | — 

ilb( Fin (E), t) 5+5(75) 


Proof. We shall apply the results of Chapter 4. Recall that B and I), are 
respectively the sets of all proper polynomials in A(X) and of all proper 
multilinear polynomials of degree n. Every T-ideal is generated by its proper 
elements. 

(i) Since the Grassmann algebra satisfies the identity [x1, #2, #3] = 0 (see 
Exercise 2.1.4), we obtain that ([1, x2, x3])? C T(E). Let 


w= [#i,,...,2,]---[@y,,---,e),] € K(X) 
be any product of commutators. Hence, if w ¢ ([21, 22, 23])?, then 
w= [Ti,, x2] tee [Tioga s Ving |- 


The anticommutative law [x,;,2;] = —[x;,;] and Lemma 5.1.1 give that, 
modulo the T-ideal ([z1, #2, 23])”, we can change the places of the variables 
in w. In particular, if two of a;, and x;, in w are equal, we obtain that 
w € ([a1,@2,%3])?. Hence B/(BN ([a1, 22, @3])7) is spanned by 


w= [ais , Cio] ---[Wiop_as Ving | Wy <to <1. << dop_d < lok, k= 0,1,... 


We shall prove the statement if we establish that any nontrivial linear com- 
bination of these elements is different from 0 in £&. Since w is uniquely deter- 
mined by its multidegree and TF) is homogeneous, we have to prove that 
w itself is not a polynomial identity for £. But 


leq, €9| see [eoe-1; eon] = 2 eres -.-€9--—1€9k # 0 


in # and this completes the proof. 
(ii) From the proof of (i) we obtain that [,(/) is spanned on 


[ai, r9| see [@on—1, Lor, n= 2k, 


and I,(£) = 0 if n is odd. Hence, if ¢, = 1 for n even and ¢, = 0 for n odd, 


then 1 
Yn(E) = diml, (FE) =e, = at +(-1)"),n=0,1,2,..., 


1 
pt= Des 


k>0 
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1 
4(E,t) = 5 (ei +e), 
and the proof follows from Theorem 4.3.12 (ii) by easy calculations. 
(iii) As in (ii), By, (Z) has a basis 
{[is, @io)-- [Pion 1 > Pion) | ES ty < ta <<... < ton-i < ton < mj. 


The Hilbert series of B,,(/) is equal to the sum of the elementary symmetric 
polynomials 


Scape (t1,---5tm) = , (Ie —t)+ Te +1) 


k>O0 t=1 i=l 


and we apply Theorem 4.3.12 (i). 


Exercise 5.1.3 Show that the polynomial identities of the Grassmann alge- 
bra E, of the k-dimensional vector space Vz, k > 1, follow from [#1, 72, #3] = 
0 and the standard identity so,(#1,...,¢2p) = 0, where p is the minimal 
integer with 2p > k. 


Hint. Show that s2,(#1,...,@2») = 0 is not a polynomial identity for E(V2,) 
and, modulo the T-ideal generated by [a1, x2, x3], the identity 
[v1, 22]... [Top41, Top+2] = 0, 


which holds for E(V2p41), is equivalent to the standard identity so,4. = 0. 


5.2 Polynomial Identities of the Upper Triangular 
Matrices 


Theorem 5.2.1 Let K be any infinite field and let U,(K) be the algebra of 
k x k upper triangular matrices. 
(i) The polynomial identity 


[%1,@2]...[@or—1, ox] = 0 


forms a basis of the polynomial identities of U;,(K). 
(ii) The relatively free algebra F(U,(K)) has a basis consisting of all prod- 
ucts 
eT OO i Baye Lin al [Pips Vins Vip eds 
where the number r of participating commutators is <k —1 and the indices 
in each commutator [%;,,,€iz,,--.,i,,,] satisfy t1s > tas <6. < tpys- 


Proof. We have seen in Exercise 2.1.9 that U,(A) satisfies the polynomial 
identity 
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[%1,@2]...[@or—1, ox] = 0 


and we shall show that all other polynomial identities for U;,(A) follow from 
this identity. We use the same idea as in the proof for the Grassmann algebra 
in Theorem 5.1.2. We shall see that modulo this identity every element of 
K(X) is a linear combination of the elements from part (ii) of the theorem 
and that any nontrivial linear combination of these elements does not vanish 
on the algebra U;,(K). Following our philosophy, we shall consider proper 
polynomial identities only. It is also convenient to work in the relatively free 
algebra 
F(&) = K(X) / (ea, £9] see [@on—1, won|)’, 


where, in this section only, we denote by Zy the variety defined by the poly- 
nomial identity under consideration. In other words, we work modulo the 
T-ideal 

T (Ze) = (faa, r9| wee [@on-1, avon])? 


For simplicity of the exposition we give the proof for the cases k = 2 and 
k = 3 only. The general case is similar. 


Let & = 2. Then in F(Z) 
[a 1, vel[ag, 4] = 0 
and B(%2) is spanned by 1 and by all commutators 
[i,,i.,---,2i,], 2 > 2. 
Using the identity 
0 = [x1, v2][e3, v4] — [e3, e4][e1, v2] = [[21, r2], [ws, v4]] = 
= [&1,@9,%3, v4] — [@1, Uo, Xa, xsl, 
we see that in (2) 
[Yi Y2,®o(t)s-- +s o(p)] = [Yi Yas ¥1,---, Bp], 7 E Sp. 


Additionally, the Jacobi identity and the anticommutativity allow to change 
the places of the variables in the first three positions: 


[v1, €2] = —|x2, x1], [@3, @2, 1] = [@3, @1, €2] ~~ [@2, 1, 3]. 
In this way, we can assume that B(Z2) is spanned by 1 and 
[@i1, Lin, Cig; . .., 2,]; a1, > tg < 13 <u..< in. 


We shall show that these elements are linearly independent modulo the T- 
ideal of U2(K). Let 


f(@1,...,&m) = SCaj(ri,,2i,,-.-, 2%] =0,% >ig<...< tin, a; EK, 
Fi 
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be a nontrivial polynomial identity for U2(A). We fix 4; maximal with the 
property a; # 0 and consider the elements 


Ti, = 1g + &:,€22, Fj = €je22, J AU, &i,,€6; EK. 
Concrete calculations show that 
f(#, oe ., Em) = ( S- ai€is os «,] €12 
iy fixed 


which can be chosen different from 0 because the base field K is infinite. Hence 
all coefficients a; are equal to 0 and this completes the proof for k = 2. 


Now, let & = 3. Then in F'(%3) 
[v1, v2][v3, ea][v5, x6] = 0 
and B(%3) is spanned by 1, all commutators 


[i,,i.,---,2i,], 2 > 2. 


and all products of two commutators 


[igs Pin, a 5 €i,|[@ ji, Bio, 7e 2 jq]. 


Applying the identity 


(Iv. vel, (ys, yal, v5) = (Lyn, ve, vs), Lys, yal] + [Ly1, ye], Lys, yas ysl], 


we see that [[y1,..-, Ya], [21,---, 26], #1,---, te] is a linear combination of prod- 
ucts of two commutators. As in the case k = 2 we see that B(%3) is spanned 
by I, 
[@i,,@i,,---, 2, ], a1, > tg <u. tn, 
[ips Vins Ci, ][%7,,Bjo,--- &], uy > 19 < te < tp y i > jo < se <Jqs 


and it is sufficient to show that these elements are linearly independent. 
Considering a linear combination and replacing 21, %2,... by 2 x 2 upper 
triangular matrices (regarded as 3 x 3 upper triangular matrices with zero 
entries in the third row and in the third column) we may assume that 


f(e., . ..;€m) = S- aij ltiy, tin, . 5 @i [tj 1, @ jo, _ 5&5] = 0, ays EK. 


Now we consider a maximal pair (#1, j1) with a;; # 0 (first maximal in 7, 
and between all such pairs, maximal in j,). Let 


Bi, = €12 + &:, €29 + mM, €33, 23, = €23 + €j,€22 + 75, €33, 


xj = €€29 + LE33, i # M5515 15 6515 Ely Mis Mj M EK. 


If 7, = j1, then we assume that 
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Ei, = €12 + €23 + &, e209 + Hi, €33- 


Again, 
FB 15.25 Bm) = (YO ai;Sia Kip (in — Ee) (i, — Ei) €19 


and this can be made different from 0. Therefore the above products of com- 
mutators are linearly independent modulo the polynomial identities of U3(K’) 
and this completes the proof of the case k = 8. 


Remark 5.2.2 Let R be a finitely generated Pl-algebra satisfying a nonma- 
tric polynomial identity (i.e. an identity which does not hold for the 2 x 2 
matrix algebra Ms(A‘)), the field A being infinite. A result of Latyshev [162] 
from 1966 gives that R satisfies some polynomial identity of the form 


[ay, r9| tae [@on-1, LK] = 0. 


From this point of view the polynomial identities of the upper triangular 
matrices serve as a measure how complicated are the polynomial identities of 
R. Nowadays the theorem of Latyshev is a direct consequence of the classical 
theorem of Amitsur that over an infinite field the T-ideals of the matrix alge- 
bras are the only prime ideals (see the proof e.g. in the book by Rowen [231]) 
and of the more recent theorem of Razmyslov-Kemer-Braun [220, 140, 38] 
that the Jacobson radical of every finitely generated Pl-algebra is nilpotent. 


Exercise 5.2.3 Let L/L, and L/L” be respectively the free metabelian 
Lie algebra of rank m and of countable rank over an infinite field. Let 


U(K)) be the Lie algebra of 2 x 2 upper triangular matrices. 
(i) Show that L/L” has a basis 


{i,[ti,,27,,---,@i,] |@=1,2,..., 1 > tp Sig <1. Sig, n > 2}. 





ll ow that the metabelian polynomial identity 
ii) Sh hat th beli 1 ial identi 
[[v1, v2], [x3, ea]] = 0 


forms a basis for the identities of the algebra U2(K)‘). 

(iii) Calculate the codimensions of the identities of Ue(K)O). Calculate 
the ordinary and the exponential codimension series. 

(iv) Calculate the Hilbert series of Ly,/L",. 


Hints. (i), (ii) Repeat the arguments of the proof in the associative case. 
(iii) Answer: 


e1(U2(K)O) = 1, en(U2(K)O) =n-1,n>1, 
e(U2(K),t) =t4+ (t—leb +1. 


(iv) Consider the vector subspace W of K(x,...,%m) with basis 
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{ai (wi, ...%7,) | tg <0. Stn n > 2}. 
Its Hilbert series is 


(1) + ...4+ tm) (Hilb(K[r1,...,¢m],t1,.-.,tm) -1) = 


= (t) +...+tm) (1--). 


Let W, be the vector space of the polynomials without constant and linear 
terms and let Wo = Li, /L", be the commutator ideal of Dp,/Ly,. Define a 
linear mapping ¢: W > W'@W"” by 





(44,25. a .&5,) = Xj, Ujg.. Li, E Wi, if 44 < te, 
(€4, 25. a .&5,) = [@i,, Lis, . 5%, ] EWs, if 41 > do. 


Show that ¢ is an isomorphism of graded vector spaces. Derive from here 
that 


Hilb(Lin/L!,,t,...,tm) = Sr + (se - 7 I —. 
t=1 t=1 t=1 : 


Exercise 5.2.4 Let G be the two-dimensional nonnilpotent Lie algebra over 
a field of characteristic 0. (If G has a basis {a, 6}, then, up to an isomorphism, 
the multiplication in G is given by [a,6] = a.) Show that G generates the 
metabelian variety of Lie algebras, i.e. the T-ideal of G is generated by the 
metabelian identity. 


Hint. Show that G satisfies the metabelian polynomial identity and the n— 1 
multilinear elements of degree n > 1 in Exercise 5.2.3 (i) are linearly inde- 
pendent in F'(G). 


Exercise 5.2.5 Let R be the algebra of the 3 x 3 upper triangular matrices 
of the form 


AN * x 
Or *], 
0 0 Xr 


where the *’s denote arbitrary elements of the field, and let chark = 0. Show 
that 

T(R) = ([z1, ©2, x3], Sa(1, 2, %3, w4)). 
Is this algebra isomorphic to the Grassmann algebra of the two-dimensional 
vector space? 


Hint. Show that the algebra satisfies the given polynomial identities and use 
the scheme of the proof for the polynomial identities of £ in Theorem 5.1.2. 
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A possible way to show that the algebras R and (V2) are not isomorphic 
is the following. The vector subspaces of R and E(V2) spanned respectively 
by {e12, €13, €23} and {e1, e2,e1e2} are the only maximal nilpotent ideals of 
these algebras. In the ideal of #(V2) the square of every element is 0 and in 
the ideal of R there exist elements which do not share this property. 


Exercise 5.2.6 Let 


a(t) = YG. b(t) = 5. 


p20 : q20 


Show that 


a(t)b(t) = )~ (> (;) stn) —. 


n>o 


Exercise 5.2.7 Show that 


e(Un(K),t) =e S(t et +1 = ll — ((t — let +1)*). 


p=0 


Hint. Use the description of the multilinear elements of the relatively free 
algebra F(U,(K)) in Theorem 5.2.1 (ii), Exercise 5.2.3 (ili) and Exercise 
5.2.6. 


Exercise 5.2.8 Find a basis of the polynomial identities for the Lie algebra 
of k x k upper triangular matrices over an infinite field. For k = 3,4 find 
the codimensions of the T-ideal and the Hilbert series of the relatively free 
algebra. 


Hint. Try to find the basis of the identities 


([z1, x9], [v3, val], [t5, x6]] = 0, [[x1, x9], [v3, val], [t5, x6], [v7, x8] = 0, 


respectively for k = 3 and k = 4 without consulting books. For the basis of the 
identities for k > 4 see e.g. the book by Bahturin [21]. For the Hilbert series 
and the codimensions try to modify the solutions of the previous exercises. 


6. Methods of Commutative Algebra 


Many results on Pl-algebras generalize well known results of commutative 
algebra. On the other hand, methods of commutative algebra are very ef- 
fective in the study of Pl-algebras. In this chapter we give a background on 
commutative algebra including the Hilbert-Serre theorem for rationality of 
Hilbert series of finitely generated graded modules of polynomial algebras in 
several variables. We apply this theorem to varieties of algebras satisfying 
nonmatrix polynomial identities. We also give some idea of commutative and 
noncommutative invariant theory. There are many well written books on dif- 
ferent aspects of invariant theory and we have limited our considerations to 
a sequence of exercises and remarks, with references to books and expository 
articles. 


6.1 Rational Hilbert Series 


Definition 6.1.1 The generating function of the sequence ao, a1, @2,... of 
elements of AK’ is defined as the formal power series 


a(t) = S- Gynt”. 


n>o 


Exercise 6.1.2 Let f,, n = 0,1,2,..., be the sequence of the Fibonacci 
numbers defined by 


fo =f, = 1, fn42 = Engi t fay n=0,1,2,... 
(i) Show that its generating function is equal to 
nt”? = = 
=D hat = 
n>o 


(ii) Prove that f, = cia? + cea}, where ci,co are fixed numbers and 
a1, @2 are solutions of the quadratic equation £7 = € + 1. 


Hint. (i) Show that 
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F(t) = tf) +0 FO + fot (fi fot. 


(ii) Prove that all sequences ao, a1, d@2,..., satisfying anyo = Gnz1 tan, n= 
0,1,2,..., form a two-dimensional vector space with basis consisting of the 
two sequences 

1,aj,a7,...,7= 1,2. 
Exercise 6.1.3 Let ao, a1, a2,... be a sequence and let a(t) = )> 9 ant” be 


its generating function. 
(i) Show that the sequence satisfies a linear recurrence relation 


Antp = Pidn4p—-1 +... + Bp-14n41 + Bpan, n= 0,1,2..., 


(for some fixed numbers ,..., 3,) if and only if the generating function a(t) 
is rational. 

(ii) Let the sequence satisfy the above recurrence relation and let the 
different zeros of the equation 


EP = BEPE Ft Bp 2k? + Bp1€ + Bp, 

be a,,...,@,% with multiplicities respectively equal to d,,...,d,. Show that 
a, = (n)at +...+¢n(n)az,n=0,1,2,..., 

where each ¢;(n) is a polynomial in n and dege;(n) < d; —1,¢=1,..., 


Hint. Repeat the arguments of Exercise 6.1.2. 


Exercise 6.1.4 If f(t) € C[t] is a polynomial such that f(Z) C Z, prove that 


3 


f= Trtlt= VD ...(€= (k= 1), ay €Z. 


Recall that a vector space M is a (left) module of the algebra A if an al- 
gebra homomorphism A —> Endx (M) is given. (Since we work with unitary 
algebras, we assume that the unity of A maps to the identity linear oper- 
ator on M.) The commutative algebra is noetherian if its ideals satisfy the 
ascending chain condition: For every infinite sequence of ideals 


I, ChCIsC... 


there exists an integer n such that 





Ln Int In42 





Hilbert Basis Theorem 6.1.5 Jf the commutative algebra A is noetherian, 
then the polynomial algebra A[z] is also noetherian. 
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Exercise 6.1.6 (i) Find in the books (e.g. in the book by Lang [161] or 
Atyiah and Macdonald [16]) and read the proof of Hilbert Basis Theorem 
6.1.5. 
(ii) Prove that any finitely generated commutative algebra is noetherian. 
(iii) If M is a finitely generated module of a finitely generated commuta- 
tive algebra A, then every A-submodule of M is also finitely generated. 


Exercise 6.1.7 Let A = K[a1,...,%m] and let S be the subalgebra of all 
symmetric polynomials in A. Show that A is a finitely generated S-module. 


Hint. Let e; = e;("1,...,%m) be the 7th elementary symmetric polynomial. 
Then for a fixed k, 
[[@ —a)=aP— eat +...4+ (-1)"lem—18 + (-1)"em = 0 
i=l 


and A is generated as an S-module by the products af! ...%2",0 <a, <m. 


Definition 6.1.8 Let M be a module of the graded algebra A = 57,,,, AM), 
One says that the module M is graded if M is a graded vector space with the 
same grading as A, (i.e. M = 37,5, M*)) and 


AM MO) Cyt), 


Hilbert-Serre Theorem 6.1.9 (See [16].) Let A = K[a,...,@m] be the 
polynomial algebra in m variables graded by assuming that the variable x; 1s 
of degree dj, i=1,...,m. LetW=) 7,55 W) be a finitely generated graded 
A-module. Then the Hilbert series of W 


Hilb(W,t) = So dimw(™4" 
n>o 


ts a rational function of the form 


mo 4 
Hilb(W,t) = f(t) [J Tom’ 


i=l 


where f(t) © Zt]. 


Proof. Since W is a finitely generated graded A-module, for each n the homo- 
geneous component of degree n of W is finite dimensional. Hence the Hilbert 
series of W is well defined. We apply induction on the number m of the gen- 
erators of A = K[a1,...,@m]. First, let m = 0. Then A = K and W isa 
finite dimensional graded vector space. Therefore, the Hilbert series of W is 
a polynomial. 


62 6. Methods of Commutative Algebra 


Now, let m > 0 and let the statement of the theorem be true for m — 1. 
The element x,, acts on W by multiplication, which (in this proof only) we 
denote by yz, i.e. p(w) = ew, w © W. The restriction pz, of on W isa 
linear transformation of W‘) into W("+4~), Let 


U™ = Kerpn, Imp = p(W) 
be respectively the kernel and the image of ju,,. Let 


U= S- UU”) T= S- Trtdn) where Ttdm) = Wirtdm) Tmpin. 
n>o n>o 


Both U and T are finitely generated A-modules, because U is a submodule 
and T is a factor module of W (see Exercise 6.1.6 (iii)). Clearly, 


tm”) = &mKerpn = 0, 


tml ete) = tm(Wt4~) /Tmptn) = Impnsa,, /IMpnsa,, = 0. 


Hence the action of «,, on U and on T is trivial (i.e. UV and T are annihilated 
by «»,) and this means that U and T are graded A [a1,...,%m-—1]-modules. 
Since 

W /UM = Imp, Tt) = Weter) Tmpin, 


we obtain that 





dimW”) — dimU™ = dim Imp, = dimw (+4) — dimT(@ +4), 
Multiplying this equation by t”+¢™ and summing with respect to n we get 
t¢™ (Hilb(W, t) — Hilb(U, t)) = Hilb(W, t) — Hilb(T, t) — g(t), 
where 
dm—1 
g(t) = S- dimw(™)4" 
n=0 
is a polynomial in n with integer coefficients. Hence 
(1 — t*™ )Hilb(W, t) = Hilb(7, t) — ¢¢ Hilb(U, t) + g(t). 


Since U and T are finitely generated K[a1,...,&m—1]+modules, we apply the 
inductive assumption and obtain the desired result. 


Exercise 6.1.10 If in the notation of Hilbert-Serre-Theorem 6.1.9 all vari- 
ables x; are of first degree, show that for n sufficiently large, dimW) is a 
polynomial in n (with rational coefficients) of degree less or equal to m — 1. 
Usually this polynomial is called the Hilbert polynomial of W. 


Hint. Let f(t) = )72_, axt®. Use that dimW”) is equal to the coefficient of 
t” in f(t). —t)~™ and 
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qo En) 


n>o 
Hence » 
—1 
dimw => ak ("en on 1") 
k=0 
for n > p. 


Exercise 6.1.11 In the notation of Exercise 6.1.10, let the order of the pole 
at t = 1 of the Hilbert series Hilb(W,t) = f(t)(1 — t)™ be equal to d. Show 
that the degree of the Hilbert polynomial of W is equal to d— 1. 


Exercise 6.1.12 Generalize Hilbert-Serre Theorem 6.1.9 to the case when 
W is a finitely generated multigraded K[a1,...,@m]-module and prove that 
there exists a polynomial f € Z[t1,...,tm] such that 





Hilb(W,t1,...,¢m) = f(ti,...,tm TIS 


6.2 Nonmatrix Polynomial Identities 


In this section we give some applications of commutative algebra to the the- 
ory of Pl-algebras. The following result is due to Drensky [75] and is based 
on the idea of Krasilnikov [156] to consider some T-ideals as modules over 
polynomial algebras. The result is also valid if we consider nonunitary alge- 
bras. 


Theorem 6.2.1 Let K be an infinite field and let W = T(R) be the T-ideal 
of the identities of an algebra R satisfying for some k < 1 the polynomial 
identity 
[ay, r9| tae [@on-1, LK] = 0. 
Then for any m > 1 the Hilbert series 
Hilb(F, => dimFin 


n>o 


of the relatively free algebra 
Fy (R) = K(y,...,@m)/(K(t1,...,@m) NW) 


is a rational function of t. 
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Proof. Let Wy be the T-ideal of K(x1,...,%m) generated by the polyno- 
mial identity [1,22] ...[vo4%-1, 224] = 0. This means that, as an ideal, W, is 
generated by all elements 


[ui, Uo]... [ten—1, ton], wi € A(0,...,¢m). 


For simplicity of notation we write W instead of K(a1,...,%m)7W. We 
apply induction on k. The base of the induction, & = 1, follows from Hilbert- 
Serre Theorem 6.1.9, because F,,(R) is a graded homomorphic image of 
K[a1,...,®m]. (By Exercise 4.3.7, if we consider unitary algebras, Mi, (R) 
is isomorphic to K[#1,...,%m].) Now, let k > 1 and let W D W,. Since the 
following isomorphism of graded vector spaces holds 


Fy,(R) = K(a1,...,%m)/W = 


= K(01,...,tm)/(We-1 + W) © Wye_-1/(We_-1 NW), 


it is sufficient (why?) to assume that W,_1 > W D> W, and to show that 
the Hilbert series of Wy_1/W is a rational function. We shall consider the 
case k = 3. The case k = 2 is simpler but does not give the idea about the 
typical difficulties. The general case & > 3 is similar to k = 3. In Theorem 
5.2.1 we have already seen that W2/Ws3 is spanned as a vector space by the 
polynomials 


— a1 a 
Uabe = @T tH [Wb Wbos + Vb, | [ers Wea +++ Le]: 


(We may assume that 6; > by < b3 <... < bp, 1 > ep < 03 <1. < ey, 
but we do not need this fact.) We consider the polynomial algebra A = 

















Kl&,m,G | ¢ = 1,...,m] in 3m variables and define an action of A on 
W./Ws by 

Ep eat. 20" [ab , Boo, - ++ €b, | [Bers Beas +--+ Leg] = 

= agate [ao @do, 5 €by|[@ers Peas +++ Leg], 

meat... 2h [ab , Loo) - +5 €b, | [Bers Beas +--+ Leq] = 

= att [Wy Bboy 5 Vy Vi] [@ey, Peas, Ley], 

Cie att. ane [ae @bos 5 €b, |e. Veg, ++) Ce] = 

Sat. eh (tp, @by,-. 5 Ub, |[@eys Legs. +, Leys Li), 


i.e. €; multiplies from the left the polynomial way. by x;; nj and ¢; add the 
variable x; in the end of the first and the second commutator, respectively. 


We shall use that the following identities hold in W2/Ws: 
Yo(1) «+ Yp(ny L215 225 2o(3)s «++ Zo(p)|[U1, U2, Ur (a), -- +5 Ur(g)] = 


= 91 ..-Yn[21, 22, 23; -- +, Zp] [ter, Ua, U3, .--, Ua], 
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where p € S,, 0 € Sp, T € Sy and o and 7 fix 1 and 2. This means that the 
action of A on W2/Ws3 is well defined and W2/Ws is an A-module generated 


by a finite system of elements 





[Ti,, £;,][2is, ria], a4, ta, 23, i4 = 1, see ™, 


Since the base field is infinite, W/W is a graded vector subspace of W2/W3. 
Unfortunately, it is not obliged to be an A-submodule. We shall avoid this 
difficulty in the following way. Let f = f(a1,...,%m) € W/Ws, 


f= SS aber? A” [tb Mba @by|[@ers eas +++ ey], Mabe € KK. 
Let 2 be an arbitrary element of kK. We fix a variable x;. Since W is a T-ideal, 
F(vi t+ Blei, vij,.--,%m + Blem, vi]) € W/Ws. 
Direct calculations show that 
[er + Ser, ei], es + Bles, ei] = [eres] + Oler, es, ei) + Fler, ai], [es x], 
f(e1 + Blei, ¢i],.--,¢m + Blam, vi]) = F(1,...,a&m)+ 








+ S- abet y) ...@H (lb, , Loa, +++, Ly, Ville, Leg, +--+, Le, |+ 
+ S- abet)... @H (lb, , La, -- +5 Ly] [Boys Pegs «+5 Legs Vi] + 
+8? SS arert? AM [by Mba 65 @by Vad [@ey, Weg, Ley, Ci] = 


= (1+ B(m +6) +B? mii) * f(e1,..., 2m). 
Applying Vandermonde arguments (K is infinite!) we obtain that 


(ni + Gi) * f(a1,-..,@m) © W/W, mii * f(01,--.,&m) © W/W. 


Since 
&; * f(@1,...,@m) = ef (41,...,e%m) © W/Ws, 


we establish that W/W is a submodule of W2/Ws3 with respect to the action 
of the (finitely generated) subalgebra S of A generated by 


&,m +i, mG,?=1,...,m. 


Since 7; + ¢; and 7;¢; generate the algebra of symmetric polynomials in two 
variables ™,¢;, by Exercise 6.1.7 (and its hint!) we obtain that A is a finitely 
generated S-module. Now W2/W3 is a finitely generated A-module, hence 
W./Ws is finitely generated also as an S-module (why?). Therefore, its fac- 
tor module W2/W is also finitely generated. The conclusion of the theorem 
follows from Hilbert-Serre Theorem 6.1.9, since the usual grading of the free 
algebra is the same as the grading of S defined by 


degé; = deg(ni + ¢i) = 1, deg (nici) = 2. 
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The proof of the following corollary of Theorem 6.2.1 follows immediately 
from the theorem of Latyshev (see Remark 5.2.2) that any finitely generated 
Pl-algebra which satisfies a nonmatrix polynomial identity, satisfies also for 
some k > 1 the polynomial identity 


[ay, r9| tae [@on-1, LK] = 0. 


Although we do not know whether R itself is finitely generated, the relatively 
free algebra F(R) is m-generated and we may apply the theorem. 


Corollary 6.2.2 If the base field K is infinite and the algebra R satisfies 
a nonmatrix polynomial identity, then the Hilbert series Hilb(F,(R),t) is a 
rational function. 


Exercise 6.2.3 Generalize Theorem 6.2.1 and Exercise 6.2.2 and prove the 
rationality of the Hilbert series Hilb(Fy,(R),t1,...,¢m), where the base field 
is infinite and the algebra R satisfies a nonmatrix polynomial identity. 


Remark 6.2.4 The action of the polynomial algebra in the proof of The- 
orem 6.2.1 was first used by Krasilnikov [156] to show that over a field of 
characteristic 0, every Lie algebra which satisfies the polynomial identity 


[[a1,2],-.-, [@2r—1, vex]] = 0 


has a finite basis for its polynomial identities. His proof is based on the follow- 
ing approach. Let W, be the T-ideal of the free Lie algebra L,,, consisting of 
the polynomial identities in m variables of the Lie algebra of k x & upper trian- 
gular matrices. Then for the T-ideals W C L,, such that W,_1 > W D Ws, 
the vector space W/W, is a submodule of a finitely generated module of a 
finitely generated commutative algebra (as in Theorem 6.2.1). Besides, Krasil- 
nikov used that over a field of characteristic 0, modulo the T-ideal T(R) of 
the polynomial identities of a finite dimensional algebra R, every polynomial 
identity is equivalent to a system of polynomial identities in not more than 
dim variables. Finally, since Wz is the T-ideal of the Lie algebra of k x k 
upper triangular matrices, it is sufficient to consider m = $n(n + 1), the 
dimension of this algebra, and to apply the fact that the T-ideals W such 
that W,_1 D W D W, satisfy the ascending chain condition (and the ideal 
W,, itself is finitely generated as a T-ideal). The same approach works for 
associative algebras (charAk = 0) satisfying the polynomial identity 


[ay, r9| tae [@on-1, LK] = 0. 


The associative case was initially handled by Latyshev [164] and Genov [110, 
112] in 1976 using another technique. As we have already mentioned in Chap- 
ter 3 (see Theorem 3.1.4), in 1987 Kemer proved that every associative algebra 
over a field of characteristic 0 has a finite basis for its polynomial identities. 
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Recently, Theorem 6.2.1 has been generalized by Belov for all relatively 
free associative algebras. 


Theorem 6.2.5 (Belov [26]) For any relatively free associative algebra Fy,(R) 
over an infinite field K, the Hilbert series Hilb(/,(R),t) ts a rational func- 
tion of t. 


The proof of Belov uses ideas of commutative algebras similar to these in 
the proof of Theorem 6.2.1 and is essentially based on the results of Kemer 
on structure theory of T-ideals. (See Chapter 8 for a short description of the 
results of Kemer.) In particular, Belov uses the positive solution of the Specht 
problem in characteristic 0 and some weaker version of this result in positive 
characteristic. 


Exercise 6.2.6 Show that if a T-ideal W in the free Lie algebra L over an 
infinite field AK contains the polynomial identity 


[[v1,@2],..-, [@2n-1, ®2x]] = 0 


then the Hilbert series of Lm/(Lm MW) is rational. 


Hint. We give hints for & = 3 only in order to use the notation of the proof 
of Theorem 6.2.1 (e.g. Ws is an ideal of the free associative algebra of rank 
m). Assume that Dm C K(a1,...,%m). Consider the case W D> Wo. Let 


So=K[yi + Gi, mdi [t= 1,...,m]. 


Show that the set of the proper polynomials in W2/Ws3 is a finitely generated 
So-module and (Lm NW2)/(Lm Ws) and (W + W3)/Ws are its submodules. 
Hence (Lm 1 W2)/W is a finitely generated graded Sp-module. 


6.3 Commutative and Noncommutative Invariant 
Theory 


In this section we give some idea of classical (commutative) invariant theory 
and its noncommutative generalization. If the reader is interested in this 
topic, we forward him or her to the books by Dieudonné and Carrell [60], 
Springer [246] and Sturmfels [247] (for classical invariant theory), to the book 
by Formanek [108] for the invariant theory of matrices, and to the book by 
Kharchenko [147] and the survey articles by Formanek [106] and Drensky 
[84] for the noncommutative generalizations. Till the end of the section we 
assume that the base field K is algebraically closed (although most of the 
considerations do not require this) and of characteristic 0. 
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Exercise 6.3.1 Let C be a commutative algebra generated by a finite set 
{c1,..-;¢m} and let S be a subalgebra of C' such that the generators of C 
are integral over S. This means that for each c; there exists an n; € N and 
az; € S, such that 


n ni-l 
c;' + aie; * +... 4+ Qj n,-1C) + Qin; =0. 


Show that the algebra S is finitely generated. 


Hint. Let So be the subalgebra of S generated by all coefficients a;; appearing 
in the above equations. Show that every power of ¢; is in the Sp-submodule 


of C generated by 1,c;,..., ct, Hence, as an So-module, C' is generated 
by the products cf?...ch™, 0 < p; <n; — 1. Since So is noetherian (as 
a homomorphic image of K[x,;; | j = 1,...,mi,¢ = 1,...,mJ), every So- 


submodule of C is finitely generated. Let S be generated as an So-module by 
b,,...,6,. Then, as an algebra, S is generated by all a;;, bg. 


Exercise 6.3.2 Let C be a finitely generated commutative algebra, which 
is a finitely generated module of a subalgebra S. Show that S' is also finitely 
generated as an algebra. 


Hint. If C is generated as an algebra by c1,...,¢m, show that there exist 
positive integers n,,...,%m such that C is generated as an S-module by 
cht... .cP™, 0 < p; < n;. Then modify the proof of Exercise 6.3.1. 


Definition 6.3.3 Let C' be a vector space or an algebra and let G be a group 
which acts on C' as a group of automorphisms. (This means that we have a 
homomorphism G —+ AutC’.) We denote by C the set of the fired elements 
in C, Le. 


C?% ={ce€ C| g(c) =c for all g € G}. 


If G is a subgroup of the general linear group G'L,,(A’) with its canonical 
action on the vector space V,, with basis {a1,..., am}, we extend the action 
of G on the polynomial algebra K[21,...,@m] by 


g(f(t1,..-,%m)) = flg(a1),---,9(@m)), f © Klti,...,em],g EG. 


The algebra of invariants of G is the algebra K[x1,...,2%m]%. Similarly we 
define the action of GL,,(K) on the free algebra A(a1,...,2%m) and on rela- 
tively free algebras F,,(), where R is a Pl-algebra, and use the same notation 
K(a1,...,%m)@ and Fy,(R)° for the invariants of a subgroup G of GIm(K). 
(The action of GLy,(K) on F,(R) = K(21,...,%m)/T(R) is well defined 
because T(R) is a T-ideal.) 


Remark 6.3.4 In classical invariant theory usually one considers the action 
of GL» (K) on K™ and treats the polynomials in A [a1,..., 2%] as functions, 
i.e. the action of GL, (AK) on K[x1,...,&m] is given by 
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(@f)(€) =f '(©), f € Klei,.--, am], € = (€1,-..,&m) € K™. 


This means that V, = span{x1,...,@m} is the dual module of the GLy, (K)- 
module K™. We prefer our Definition 6.3.3 because it works better for non- 
commutative algebras. In order to fit it to the classical definition we have to 
replace K™ with its dual GL,(A)-module (K™)*. 


We start with the problem of finite generation of the algebra of invariants. 


Exercise 6.3.5 Let G be a finite group of automorphisms of a commutative 
algebra C’. Show that every element of C’ is integral over the algebra of fixed 
points C®, 


Hint. Let |G| = n. Show that for every cE C 


[[ = 90) =c faye" +...4an-1¢e+ an = 0, 
geG 


where a1,...,4, € C%. 


Exercise 6.3.6 Prove the theorem of Emmy Noether [194] that the algebra of 
invariants of a finite group G acting on the polynomial algebra in m variables 
is finitely generated. 


Hint. Apply Exercises 6.3.1 and 6.3.5. 


Remark 6.3.7 One of the most famous problems in classical invariant 
theory is the 14-th problem of Hilbert whether the algebra of invariants 
K[xi,...,%m]@ is finitely generated for any linear group (= subgroup of 
GLm(K)). The negative answer was given by Nagata [189]. An exposition 
of his result can be found in the book by Dieudonné and Carrell [60]. An ap- 
proach from the point of view of derivations of polynomial algebras is given in 
the book by Nowicki [196]. In the paper [211] (see also his book [212]), Popov 
described the groups allowing nonfinitely generated algebras of invariants. 


Exercise 6.3.8 Let U;(K) be the algebra of & x k upper triangular matrices. 
Show that for & = 2 and for any finite linear group G C GL», (K), the algebra 
of invariants F,,(U2(K))© in the relatively free algebra Fi, (U2(K)) is finitely 
generated. 


Hint. Let f,,...,f, generate the algebra K[a1,...,#m]@. Consider the ca- 
nonical homomorphism Fip(U2(K)) —> K[ai,...,%m]. If fi,...,fn are 
preimages of f1,..., fr, then 


1 “\G 
hy= iq S> (fi) € Fy (U2(K))*, 


gEG 
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i.e. we liftthe invariants of the polynomial algebra to invariants of Fy, (U2(K)). 
The invariants can be lifted also using Maschke Theorem 12.1.6 that ev- 
ery finite dimensional G-module is a direct sum of its irreducible submod- 
ules (how?). Write the elements of the commutator ideal F' = F|F, FF of 
F = F,,(U2(K)) in the form 


S- Mabe® i... U0" [5,,€o,|@i 25", Mabe € K. 
Show that F’ is a finitely generated S = K[y,...,Un,21,---;%m]-module 


with action defined by 


(yf yh eB 28") ae (att em [ap eo]ao .. 08m) = 


=A AP a em lary, ay ]ae om ht Ae", 
Show that any set of generators of the S-module (F’)% together with 
hi,..-,hn generates the whole algebra Fy,(U2(K))°. 


Exercise 6.3.9 Explain why the hints to Exercise 6.3.8 do not work for 
Fn(Ug(K))%, ke > 2. 


Exercise 6.3.10 Let F = F.(U3(K)) and let G = (g), where 


g(@1) = #4, g(t2) = —#o. 


Show that the algebra of invariants F® is not finitely generated. 


Hint. Show that F° is generated by x, and roz"a0,n = 0,1,2,... Let R be 
the subalgebra of F° generated by x, and rga}a2, n = 0,1,...,p, and let 
R= F“. As atrelatively free algebra, F is multihomogeneous. Hence 


Pp 
ptl _ i j k Ia 
L_k, | LQ = ) ) Ajj Re Te Leet |, aie CK. 
j=0 \itk=p—j41 


Therefore, this is a polynomial identity for the algebra of 3 x 3 upper trian- 
gular matrices. Show that this is not true. For example, verify this identity 
On @1 = €92, £2 = €12 + €93. 


Exercise 6.3.11 Show that if F(R), m > 1, is a relatively free algebra 
such that F,,(R)° is finitely generated for every finite linear group G, then 
R satisfies a polynomial identity of the form 


p-l 
L_xh xo + ) ) Qj jRl LL LL} = 0, age EK. 
j=o t+k=p—j 


Hint. Choose g € Glm(K), 
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g(2) = —&o, g(a) = a;, 14 2. 
Follow the instructions to Exercise 6.3.10. 


Exercise 6.3.12 Let R be the algebra of 3 x 3 upper triangular matrices of 
the form 


,a, PEK, 


ooe 


* 
B 
0 


S&S * * 


Find a finite linear group G, such that Fin (R)@ is not finitely generated. 
Hint. Use Exercise 6.3.11. 


Exercise 6.3.13 Let C = K[t]/(t®), ie. #@ = 0 in C, and let R be the 
subalgebra of the algebra M2(C’) of 2 x 2 matrices with entries from C, 
consisting of all matrices of the form 


a1(t) ta a(t) 
; aj; (t) EC. 
tao (t) a99(t) 


Show that Fj,(R)%, m > 1, is not finitely generated for some finite groups 
G. 


Remark 6.3.14 Kharchenko [146] proved that Fin,(R)%, m > 1, is finitely 
generated for every finite linear group G if and only if F,(R) is weakly noethe- 
rian, i.e. noetherian with respect to two-sided ideals. Lvov [173] obtained that 
this happens if and only if R satisfies a polynomial identity of the form in 
Exercise 6.3.11. There are many other equivalent conditions given e.g. in the 
survey articles by Formanek [106], Drensky [84] and by Kharlampovich and 
Sapir [148]. (Do not be afraid of the title of the latter article! It contains 
not only algorithmic problems of algebra.) Most of the above exercises are 
restatements or partial cases of these conditions. 


Remark 6.3.15 There is a large class of linear groups including the reductive 
and the classical groups which are infinite but nevertheless their algebras of 
(commutative) invariants are finitely generated. These groups satisfy the so 
called Hilbert-Nagata condition. See the book by Dieudonné and Carrell [60] 
for the commutative and the paper by Domokos and Drensky [68] for the 
noncommutative case. 


Remark 6.3.16 Considering the invariants of a finite linear group acting on 
the free associative algebra, the only case when the algebra K(21,...,%m)° 
is finitely generated is when G is cyclic and acts by scalar multiplication, Le. 
G = (g) and g(«;) = €x;, where €” = 1. Thisis a result of Dicks and Formanek 
[58] and Kharchenko [144]. On the other hand, Kharchenko [144] proved that 
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for G finite, there is a Galois correspondence between the subgroups of G and 
the free subalgebras F of K(21,...,2%m) containing K(x1,...,%m)°%. This 
correspondence is given by 


H > K(a1,...,%m)",H<G. 


Remark 6.3.17 If G is a finite linear group, then it is easy to see (prove it!) 
that the transcendence degree of K[xz1,...,&m]®@ (i.e. the maximal number of 
algebraically independent over AK elements) is equal to m. This means that 
K[21,...,€m]@ contains a subalgebra isomorphic to the polynomial algebra 
in m variables. The famous theorem of Shephard-Todd [236] and Chevalley 
[49] gives that 

K[a1,...,2m]° = Klni,..., am] 


if and only if G is generated by pseudo-reflections (i.e. diagonalizable linear 
transformations of V,, which have | as an m— 1-multiple eigenvalue). Typical 
examples of such groups are the symmetric groups which are generated by 
reflections (in the usual geometric sense). The corresponding noncommuta- 
tive analogue of this result was proved by Domokos [67]. He showed that if 
G is a nontrivial finite linear group, then Fin(R)° = Fin(R), m > 1, if and 
only if R satisfies the polynomial identity [r,, 72,23] = 0 and G is gener- 
ated by pseudo-reflections. This shows that the polynomial identities of the 
Grassmann algebra are very close to the commutative law. 


Remark 6.3.18 Concerning free Lie algebras, Bryant [40] showed that LG is 
never finitely generated (|G| > 1, m > 1). We have mentioned that one of the 
main differences between associative and Lie Pl-algebras is that the associa- 
tive PI-algebras have good properties which make them close to commutative 
and finite dimensional algebras and this is not always true for Lie algebras. 
Such an example can be also found in noncommutative invariant theory. If 
|G| > 1 and the Lie algebra R is not nilpotent, then Fi,(R)%, m > 1, is not 
finitely generated (see the paper by Drensky [83]). 


In the next several exercises we discuss the Hilbert series of the invariants 
of finite linear groups. We assume that G is a finite group acting as a group 
of invertible linear operators on a finite dimensional vector space W. Since 
every element g of G is of finite order and chark = 0, the matrix of g 
is diagonalizable (prove it!). We denote the eigenvalues of g by &(g), ¢ = 
1,2,...,dimW. 


Exercise 6.3.19 Let the linear operator ¢ of W be defined by 


1 
=— h. 
e= dh 


hEG 


Show that: 
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(i) P= ¢. 

(ii) An element w € W is G-invariant (i.e. g(w) = w for all g € G) if and 
only if w € Imd = o(W). 

(iii) dimW® = try (@), the trace of the operator ¢. 


Hint. (i)-Gi) Use that 


1D e= T= (Ls)avec 


hEG he hEeG 


(iii) Since ¢? = ¢, use that W = Ker¢d 6 Imé¢ and ¢ acts identically on Im¢. 


Exercise 6.3.20 Let g € GL,(K) be a diagonalizable linear operator acting 
on Vip, = span{x1,...,2m}. Let €1,...,&m be the eigenvalues of g. 

(i) Show that the eigenvalues of g acting on the vector space of all homo- 
geneous polynomials of degree n are equal to 


re Gem, apt... + dm = 7. 
(ii) Let W be a multihomogeneous finite dimensional vector subspace of 
the free algebra K(x1,...,2%m) and let W be invariant under the action of 
the general linear group. Show that the Hilbert series of W 


Hilb(W,t1,-.-5tm) = So dimW ore thm 


is a symmetric polynomial in t),...,tm. Prove that the trace of g acting on 
W is 
trw(g) = Hilb(W, &1,...,&m). 


Hint. (i) The statement is obvious if z1,...,%m are the eigenvectors of g. We 
may change linearly the variables (and g still will have the same eigenvalues) 
in such a way that to be in the desired situation. 

(ii) Apply to W the linear operator o € GL (KX) defined by o(a;) = oq), 
o € Sy», and, using that o(W) = W, prove that 


Hilb(W,t1,...,tm) = Hilb(W, ta(i)s-- -;to(m)): 


For the trace trw(g) apply the arguments of (i). 


Exercise 6.3.21 If G C GL m(K), show that for any relatively free algebra 
F = F,,(R) (including F = K[ax1,...,%m] and F = K(#1,...,%m)), the 
algebra of invariants F@ is graded (but not always multigraded). 


Hint. Wf f = Woo fas fi € F, then f € F% if and only if fr € F%, 
i=0,1,...,k. 
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Exercise 6.3.21 shows that the Hilbert series of the algebra of the invari- 
ants F' = F,,(R)© is well defined and 


Hilb( Fo, t) => dim(F nr) Gyn. 
n>o 


Exercise 6.3.22 Prove the following theorem of Formanek [106]. If G is a 
finite subgroup of GL», (A), the eigenvalues of g € G are &(g), i= 1,...,m, 
and F = F,,(R) is a relatively free algebra, then 


Hilb(F%, t) a — 5 Hilb( FE, (g)t,.--,€m(g)t). 


gEG 


Hint. Apply Exercise 6.3.20. 


In the next exercises we assume that G C GL,,(K) is a finite group. We 
denote by det(g) and tr(g) the determinant and the trace of g as an m x m 
matrix. The hint to all exercises is the same: Find the corresponding result 
which gives the Hilbert series of the relatively free algebra under consideration 
and apply the theorem of Formanek in Exercise 6.3.22. 


Exercise 6.3.23 Prove the Molien formula [185] 


1 
Hilb(K[x1,...,¢mJ@,t) = — > ———_. 
( ) ia] 2e dea = 


Exercise 6.3.24 Prove the theorem of Dicks and Formanek [58] 


1 1 
Hilb(K (21, ...,2m)%,t) = — S~ ———— 
a » Tt) 


Exercise 6.3.25 Prove that for the Grassmann algebra 


. det(1 + det(1 + tg) 
Hilb( Fim (E) cP ds dei( ty) det(1 — tg)’ 


Exercise 6.3.26 Show that if R satisfies a nonmatrix polynomial identity, 
then the Hilbert series Hilb(F,,(R)%,¢) is a rational function. (This means 
that, although the algebra Fi, (R)@ may be not finitely generated, its Hilbert 
series is still nice. The same result holds for relatively free algebras satisfying a 
nonmatrix polynomial identity and (not necessarily finite) groups G' satisfying 
the Hilbert-Nagata condition, see the paper by Domokos and Drensky [68].) 
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Exercise 6.3.27 Generalize the theorem of Formanek in Exercise 6.3.22 to 
the Hilbert series of the vector spaces of the fixed points of finite linear groups 
acting on GL» (A)-submodules and factor modules of K(a1,...,2m). 


Hint. Do not forget to prove that the submodules and factor modules of 
K(«1,...,€m) are multigraded vector spaces (Vandermonde arguments!). 


7. Polynomial Identities of the Matrix 
Algebras 


Since the matrix algebras are considered of great importance both for mathe- 
matics and its applications, from the very origins of the theory of PI-algebras, 
the polynomial identities of matrices have been an attractive object for study. 
We start this chapter with two proofs of the famous Amitsur-Levitzki theo- 
rem which states that the k x & matrix algebra satisfies the standard identity 
of degree 2k. We also discuss other polynomial identities for matrices. Then 
we introduce the generic matrix algebras which are important not only for 
PI-theory itself but also for numerous applications to other branches of math- 
ematics such as invariant theory and theory of division algebras. Many impor- 
tant results in PI-theory were established or their proofs were significantly 
simplified using central polynomials for the matrix algebras. Here we give 
two essentially different approaches to central polynomials due respectively 
to Formanek and Razmyslov. For further reading on all these topics we refer 
to the books by Procesi [213], Jacobson [128], Rowen [231] and Formanek 
[108]. 


7.1 The Amitsur-Levitzki Theorem 


Recall that the standard polynomial of degree n is 


Sn (@1,.-.,2n) = S- (signa) o(1) ---®e(n)- 
TES y 


It is multilinear and alternating, i.e. 
Sn(...,@i,...,%5,---) = Sn... 5,---,8i,--), Sal... Gi... Bi...) = 0. 


The Capelli polynomial in n skew-symmetric variables is 


dy(%1, se ey ny Y1y-- sy Yn+1) = S- (signa) yi ®o(1)Y2 on -Yn®a(n)Yn4+1- 
TES y 


Exercise 7.1.1 Show that the standard identity of degree n + 1 is a conse- 
quence of the standard identity of degree n. 
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Hint. Show that 
n+l 
Sn4i (a1, ny ®n41) = So (=1)) tein (a1, tty Li, ny En41), 


i=l 


where #; means that x; does not participate in the expression. 


Exercise 7.1.2 (i) Show that the & x k matrix algebra M;,(K) does not 
satisfy a polynomial identity of degree less than 2k. 

(ii) Show that M;, (A) does not satisfy the Capelli identity d,2 = 0 (but, 
as we have seen in Examples 2.1.3 (ii), it satisfies d,24, = 0). 

(iii) Show that if f(a1,...,%24) = 0 is a multilinear polynomial identity 
of degree 2k for the algebra M;, (A), then 


f(@1,..., an) = oSo4(%1,..., on), a EK. 


Hint. (i) If M),(4) satisfies a polynomial identity of degree n < 2k, then it 
satisfies also a multilinear identity 


f(®1,..-,8n) = S- AgXo(1)++-®o(n) = 0, a0 EK, 
TES y 


of degree n. Use staircase arguments: 
Fler, €12, €22,€23,---, Eng) = WeElp,; 


where p = q or p = q — | depending on the parity of n and ¢ is the identity 
permutation. Hence a, = 0. Modify these arguments for az, where o is an 
arbitrary permutation in S;,. 

(ii) Order the matrix units e;; in an arbitrary way 


€pigi> &pogar- ++) €D 2,2 : 
Show that 
dy2 (Epa, »€poqo.- ++) &p.2q,27 @lpis Sgipar +++) ay 2_yP,2? €q,21) = €11- 
(iii) Let 
f(@1,..., 8K) = ) Ag€o(1)++-Lo(2K) = 0. 
JES 2K 
As in (i), replace 21,..., ox by matrix units. For example, 
F(€11; €11; €12, €22, +++, €k-1,k, Chk) = (He + (12) )e1z = 0 


and, similarly, ag + a, = 0 if 7 is obtained from o by transposition of two 
consequent elements r(p) and r(p+1). Derive from here that a, = (signa)a. 
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There is a nice graph theoretic interpretation of the multiplication of 
matrix units which is very useful for simplifying of calculations. For any set 
of matrix units 

P= {éi; | (i, 3) €E T} Cc M;,(K) 
we define an oriented graph G = G(P) with set of vertices V(G) = 
{1,2,...,&} and set of oriented edges {(¢,7) | (¢,7) € 7}. Then a product of 
matrix units €;,;,...€:,;, is not zero if and only if the edges (71, j1),..., (4k, Jr) 
form a path in the graph G. 

For example, if & = 4, then Fig. 7.1 gives the oriented graph corresponding 
to the set of vertices (with their multiplicities) {e11, €11, €13, €14, €21, €32}. 


Fig. 7.1. The graph corresponding to {e11, €11, €13, €14, €21, €32 } 


If we denote by v1 and v2 the two different copies of e;,, then all paths which 
go exactly once through all edges are 


141313533525154 
141353323531 3154 
1333235313513154 


1314315352351754 
131353525153154, 
1935251315154. 








3 














Since the dimension of the matrix algebra M;,(K) is equal to k?, it satisfies 
the standard identity s,24, = 0. The following theorem gives the optimal 
degree of a standard identity for M;,(K). 


Amitsur-Levitzki Theorem 7.1.3 The k x k matrix algebra M,(K) sat- 
isfies the standard identity of degree 2k 


Son (%1, . ..;€2h) = 0. 


Together with Exercise 7.1.2, the Amitsur-Levitzki theorem gives that, 
up to a multiplicative constant, the standard identity is the only multilinear 
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polynomial identity of degree 2k for M,(K). There are five essentially differ- 
ent proofs of the Amitsur-Levitzki theorem. The original proof [11] is based 
on inductive combinatorial arguments; with some technical improvements it 
can be found for example in the book by Passmann [205, p. 175]. There is a 
graph theoretic proof of Swan [249] which treats the nonzero products of ma- 
trix units as paths of Hulerian graphs (oriented graphs such that there exists 
a path passing through all edges exactly once). Recently, the basic idea of 
this proof was used by Szigeti, Tuza and Révész [250] to give another proof 
and to show that the matrix algebras satisfy also other interesting polynomial 
identities. Kostant [152] gave a cohomological proof. Here we give two other 
proofs of the Amitsur-Levitzki theorem due to Razmyslov [219] and Rosset 
[230]. We start with the proof of Razmyslov. 


Exercise 7.1.4 Show that the validity of the Amitsur-Levitzki theorem for 
M;,(Q) implies its validity for M),() over any field K. 


Hint. If rp = ie 1 ale, af) Mek ,p=l,...,2k, are matrices in M;, (4K), 


then so4(r1,...,7aR) is a linear combination of 89x (Cigjs, . ++ Ciop jon) and is 
equal to 0 because we have assumed that the theorem holds for M,(Z) C 


Mg (Q). 


Exercise 7.1.5 Let the eigenvalues of the matrix a € M,(K) be equal to 
&,...,€% and let eg(&,...,&) be the g-th elementary symmetric polynomial 
in €,,...,€%. Then 


k 
a®+S°(-1 )teq( (f1,...,&)a*~4 = 0, 
g=l1 


tr(a?) = €7 +...4 €f. 


Hint. Apply the Cayley-Hamilton theorem to the matrix a written in its 
Jordan normal form. 


The Razmyslov Proof of Amitsur-Levitzki Theorem 7.1.6 [219] We 
give the proof for 2 x 2 matrices. The general case is similar with additional 
technical difficulties only. By Exercise 7.1.4 we assume that K = Q. Let 


The Newton formulas give that for g << k 


Pq — €1Pq-1 + €2Pq-2 See (—1)*"teg-1p1 + (-1)%qeq = 0. 


We can express €,(&1,...,€) as polynomials of i ...,&€). In our case 


eal E12) = E162 = 5((Es + &2)? ~ (C+) = 5 (HE. &2) — pals, &)). 
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For any 2 x 2 matrix a with eigenvalues €1, €2, the Cayley-Hamilton theorem 
gives that 
a’ _ €1(€1, €2)a + €9 (£1, E2)e = 0, 


a? — pilGi.€a)a + 50361, &) — pal Je = 0, 


a’ — tr(aja+ s(tr%(a) — tr(a*))e =0. 
Hence 1 
a tr(w)a + 5 (tr (x) — tr(«”))e =0 


is a trace polynomial identity for M2(K). We linearize this identity (prove 
that we can linearize not only ordinary but also trace polynomial identities!) 
and obtain the identity 


(vive + your) — (tr(yi)ye + tr(ye)y)+ 


1 
+5 ((tr(yi)tr(y2) + tr(y2)tr(y)) — tr(yiye + yoyr)je = 9, 
and, since 
tr(yi)tr(y2) = tr(y2)tr(y1), tr(yiy2) = tr(yoyi), 
we see that Mo(K) satisfies the multilinear trace identity 


f(y. y2) = (nye tyoys) — (tr(yi)ye+tr(ye)y1) + (tr (yi) tr(ye)—tr(yiye))e = 0. 


Now we replace y; and yz respectively by %5(1)€o(2) and &4(3)%o(4) and take 
the alternating sum on o € S4: 


0= So (signe) f(2o(1)2o(2), Bola) to(4)) = 


oES4 


=2 S- (signa) (®o(1)®o(2)®o(3)€o(4) — tt(%o(1)%o(2))®o(3)®o(4))+ 
oES4 


+ S3 (signe) (tr (wo (1) to(2y tt (to(ato(ay) — te(o(1)®o(2)®o(s)®o(ay)e)- 
oES4 


The trace is invariant under cyclic permutations, hence 
tr(%o(1)®o(2)) = tr(%o(2)Lo(1)); 


tr(®o(1)®o(2)®o(3)@o(4)) = tt(®o(2)®o(3)Po(4)®o(1))- 


On the other hand, in each of the pairs 
(7(1), 7(2),o(3), o(4)) and (o(2), o(1), 7(3), (4), 
(7(1), 7(2),0(3), o(4)) and (o(2), o(3), o(4), 7(1)) 
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the permutations are of different parity and the summands containing traces 
vanish in 


Yo (signe) f(vo(1)o(2), Zo(a)®o(4)) = 0. 


oES4 


Therefore, we obtain that 


284 = 2 S- (signa) ao (1)€o(2)®o(3)®o(4) = 0, 
oES4 


and this completes the proof. 
Now we give the Rosset proof of the Amitsur-Levitzki theorem. 


Lemma 7.1.7 Let C' be a commutative algebra over Q and let a beak xk 
matrix with entries from C. If tr(a?) = 0 for g =1,2,...,k, thena =0. 


Proof. Let D be any field of characteristic 0 and let a € M;(D). As in the 
Razmyslov proof of Amitsur-Levitzki Theorem 7.1.6, 


k 
k_ S k-¢q 
a= Aga 5 
g=l1 


where qq is a polynomial without constant term and with rational coefficients 
in tr(a"), r = 1,2,...,¢. In particular, the same formula holds if the entries 
of a are from a polynomial algebra AK[Y] for some set Y (embed A[Y] into 
its field of fractions!). Now the algebra C' is a homomorphic image of some 
polynomial algebra (use that the polynomial algebra is free in the variety of 
all commutative algebras!). Let 6 : K[Y] —> C be a homomorphism. Then 
¢ induces a homomorphism ¢ : M;,(K[Y]) —> Mz (C) defined by $(a;;) = 
(e(a;;)). For any a € My(K[Y]) 


tr(¢a) = o(tr(a)). 
Therefore 
aq(tr(a),..., tr(a?))a*~7, a € M,(C). 


Q 
> 
I 
M4 


1 


q 
If tr(a?) = 0 for g=1,2,...,k, then a*® = 0. 


The Rosset Proof of Amitsur-Levitzki Theorem 7.1.8 [230] Again, 
we consider K = Q. Let EF be the Grassmann algebra generated by e1,€2,... 
and let E = Eo @ Ey be its canonical Zo-grading, i.e. Bo and Ey, are spanned 
respectively by the products e;, ...e;, of even and odd length. We have seen 
in Exercise 2.1.4 that Eo is a commutative subalgebra of EF. Let r1,..., rox 
be matrices in M;,(Q). Then 
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b=rie, t...+ rerear 


isa k xk matrix with entries from the (noncommutative) Grassmann algebra. 
Since e;e€; = —e;e;, we obtain that 


2k 2k 
a=b?= SoS rirjerey = Sorin; — ryri)(e1e;) 
t=1 j=1 t<j 


and a is a matrix with entries from the commutative algebra Ep. It is easy 
to see (check it for g = 2) that 


al= ) 82q(Tips +65 Ting Cty «+ + Cings tA << tq, 


tr(a’) = S tr(s2q(ri, Ley Ving) Cy + Chog) WW <...< log. 
As in the proof of Razmyslov we see that 
tr(Soqg(1,---;T2g)) = 9, T1,---, Pag © Me (Q). 
By Lemma 7.1.7 


a® = Son(T1,-.-,Tar)e1.--€2k = 0, Sox (r1,.-., Por) = 9, 


and this shows that the standard polynomial soz is an identity for M;,(Q). 


I think that now the reader is prepared to try to “solve” the following 
exercise. 


Exercise 7.1.9 Read the original proof of the Amitsur-Levitzki theorem [11] 
and the proofs of Swan [249] and of Szigeti, Tuza and Révész [250]. 


The following exercise contains a result obtained independently by Chang 
[46] and Giambruno and Sehgal [114]. It answers a problem risen by For- 
manek. 


Exercise 7.1.10 Answer in the affirmative the conjecture of Formanek that 
n = 2k is the minimal positive integer such that the matrix algebra M;,(K) 
(charK = 0) satisfies the double Capelli identity 


F(€15 0625 Uns ty Yn) = S- (sign(OT))@o(1)Yr(1) «+» ®o(n)¥r(n)- 
O,TESy 


Hint. (i) Consider the “double staircase” 
By = €11, 91 = €12, 2 = €22, Yo = €93,...,€k = Cee, 


Uk = kk, Ee4i = Ck jk—-1, UR41 = Ch-1,k-1)+++) ¥2k—1 = €21, YoR-1 = €11- 
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Using the graph theoretic interpretation of the matrix units and their prod- 
ucts, show that f(@1,...,¥2n-13 91,---; Yor—1) # 0. 

(ii) (We give the proof of Domokos [65]) Let ui,..., Wag, vi,..., Van be 
k x k matrices. Form the 2k x 2k matrices 


0 0 0 
unas = (4 iW) em= (4 Df beet 
qd 


Since the product w,;w; is not zero only if 7 and 7 are of different parity 
(why?), we obtain that 


_ F(t, ..., UaRs U1, ---; Var) 0 
supers os teas) = ( 0 FOr, .. 6, VaR) U1, +, Uae) 


and this is 0 in virtue of the Amitsur-Levitzki theorem for Mo,(K). 


The proof of Chang [46] is combinatorial and contains the stronger result 
that the double Capelli identity in Exercise 7.1.10 is a consequence of the 
standard identity sz, = 0. The proof of Giambruno and Sehgal [114] is based 
on the Rosset proof of Amitsur-Levitzki Theorem 7.1.8. I hope that the reader 
will try to read these two papers. They are short, well written and one really 
enjoys reading them. Another proof can be found in the paper by Szigeti, 
Tuza and Révész [250] (see the comments after Amitsur-Levitzki Theorem 
7.1.3). 

One of the central problems in the theory of PI-algebras is the following. 


Problem 7.1.11 Find a basis of the polynomial identities for the algebra of 
k x k matrices, k > 2, over a field of characteristic 0. 


The complete answer is known for 2 x 2 matrices only. In 1973 Razmyslov 
[217] found a basis of 9 polynomial identities of degree 4, 5 and 6. Using this 
fact Drensky [72] showed in 1981 that the polynomial identities 


84(@1, £2, 3,04) = 0, [[z1, x2]°, a1] = 0 


form a minimal basis of the polynomial identities of Mo(K). The Hall iden- 
tity in two variables [[x1, 22], 21] = 0 is a consequence of the identity of 
algebraicity in Exercise 2.1.8. There was a hope that the standard identity 
89; = 0 and the identity of algebraicity 


S- (signa) a7 Oy, a7) yo ype?) = 0 


TESp41 


form a basis for T(.M;,(K)) also for k > 2. Okhitin [198] constructed a poly- 
nomial identity of degree 9 for M3(A) which does not follow from these two 
identities. Domokos [66] found other new identities for 3 x 3 matrices. The 
most general known result about the identities of & x k matrices for any k is 
due to Razmyslov [219] and Procesi [214] who described the trace polynomial 
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identities. Freely restated, their result gives that all polynomial identities for 
M),(K) follow from the Cayley-Hamilton theorem. We have already seen par- 
tial examples of this fact in the Razmyslov proof of Amitsur-Levitzki Theorem 
7.1.6 and in the proof that the identity of algebraicity holds for the matrix 
algebra (Exercise 2.1.8). 

In the next several exercises and problems we assume that chark = 0. 


Exercise 7.1.12 Show that the identity of algebraicity for M,(K), k > 1, 
does not follow from the standard identity so, = 0. 


Aint. Let 


an (@,Y1,---; Yk) = S- (signa) a7 Oya? MY yo. -ypa?™) = 0 
TESp41 


be the identity of algebraicity. If it follows from the standard identity se, = 0, 
then 


b(a, y) = an (a, Y, oe 5 Y) = S > aitisoe (vi; oe £5 Vlog Wis age K, 


where u;,v;;,w; are monomials in K(a,y). We may assume that the total 
degree of uj, vj;,w; in y is k and in @ is $k(k +1). This means that in the 
standard identity sox (vi,,--., Ui, ) not more than & monomials vj, contain y 
and the others are positive (why? — sox (a1,..., 2%) is proper!) powers of x. 
Since the standard identity is skew symmetric, we obtain that all monomials 
vj, are different and this gives that the total degree of x is not less than 
1+2+...+k. Derive from here that all monomials v; containing y should 
be equal to y and, since k > 2, this is impossible. 


Exercise 7.1.13 Show that for k sufficiently large, (A) has polynomial 
identities in two variables which are consequences of the standard identity 
and are of degree less than the degree of the identity of algebraicity (with 


Y=... = Ye =y)- 


Hint. Order all monomials in two variables in K(x, y) by ascending degree, 
e.g. 








2 2 
U1 vt, U2 = Y, UZ =H, U4 = LY, U5 = YH, UG = Y ,.-- 


and, using “economically” these elements, try to construct a nonzero poly- 
nomial in K(x, y) of the form 


Son (Ue, ; sey Wing) i SC... lok, 
Try e.g. with so.(z,y, 27, ey, yr, y’,...) or with 


Son (2, y, TY, yx, 2 y, eyx, yx”, xy, 2° yx, yx, yr®,...) 
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Show that for & sufficiently large, the degree of the obtained identity is less 
than $k(k +1)+k, which is the degree of the identity of algebraicity. If there 
are some difficulties, see Exercise 7.4.8. 


Problem 7.1.14 Find the minimal degree of the polynomial identities in 
two variables for the matrix algebra M;,(K), k > 3. 


Problem 7.1.15 Find the minimal degree of the polynomial identities for 
M;,(4K) which do not follow from the standard identity so, = 0, k > 3. 


Leron [171] proved that for k > 2 all polynomial identities for M;,(K) 
of degree 2k + 1 follow from the standard identity. For k = 3 the same is 
true also for the polynomial identities of degree 8 = 2.3+ 2 (Drensky and 
Azniv Kasparian [91]). Since the identity of algebraicity is of degree 9, the 
minimal degree of the polynomial identities for M3() which do not follow 
from sg = 0 is equal to 9. 


7.2 Generic Matrices 


In this section we assume that the base field K is arbitrary and for the integer 
k > 2 we fix the notation 2 = 2; for the K-algebra of the polynomials in 
infinitely many commuting variables 


2 = Ky) |p,q=1,...k, t= 1,2,..]. 


Definition 7.2.1 The k x k matrices with entries from 2, 


k 
p,q=l 


are called generic k x k matrices. The algebra Ry, generated by the generic 
k x k matrices is the generic k x k matrix algebra. We denote by Rym the 
subalgebra of Ry, generated by the first m generic matrices y,,..., Ym. 


For example, for k = m = 2, changing the notation and assuming that 


usa? = Lpq, ys? = Ypq, the algebra Roz is generated by 


r= (3h me) a= ( me) 
G21 X29 Y21  Y22 
For any commutative K-algebra C’, the k x k matrices with entries from 


C' can be obtained by specializations of the generic matrices, e.g. 


k 
a= S- Ypa€pas eq E ©, 


p,q=l 
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is obtained from 


k& 
1 
y= » Us pa 


p,q=l 


(1) 


by replacing the variables ys with Ypq- 


Exercise 7.2.2 If the base field is infinite, show that the generic matrix 
algebra Ry is isomorphic to the relatively free algebra F'(M;,(K)) of the k x k 
matrix variety, 1.e. 


Ry & K(X)/T(My(K)). 


If K is a finite field and P is any infinite extension of AK, show that Ry = 


Hint. For any infinite field P containing K, show that the kernels of the 
canonical homomorphisms 


K(X) —> F(My(P)), K(X) — Rp 


coincide, i.e. f(@1,...,&n) € A(X) is a polynomial identity for M;,(P) if and 
only if f(y1,---,¥n) =O in Rp. 


Exercise 7.2.3 Let P = K[&] be an extension of degree k of the base field 
K (where € is a primitive element). Considering P as a k-dimensional vector 
space over K, let {m,..., 7} be a basis of P. Let € be the linear operator of 
P defined by E(a) = €a, a € P. Show that, up to a multiplicative constant, 
the characteristic polynomial of € coincides with the minimal polynomial of 
€ over K. 


Hint. If f(é) = 0, f(t) © Kf], then f(€) = 0. Use the Cayley-Hamilton 
theorem and the fact that the minimal polynomial of € is of degree k and is 
irreducible over K. 


Exercise 7.2.4 Calculate the characteristic polynomial 
fa(A) = det(a — Ae) 


of the matrix 


0 0 0 0 apo 

1 0 0 0 a 

0 1 0 0 az 
a= . . 

0 0 .. 1 0 Ak-~2 

0 0 .. O 1 Ak-1 


Answer. 
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fa(A) = (—1)* 8 = (agit + ago? +... Fad +.a0)). 


Lemma 7.2.5 The eigenvalues of the k x k generic matrix y, are pairwise 


different. 


Proof. We consider y; as a matrix with entries from the field of fractions of 
the polynomial algebra 2,. Let f(A) be the characteristic polynomial of y). 
Let f(A) have multiple zeros. Then the discriminant of f(A) is equal to zero. 
Since every & x k matrix a with entries from K is a specialization of y,, the 
discriminant of the characteristic polynomial f,(A) is also equal to 0 and this 
means that a has multiple eigenvalues. We shall prove the lemma if we find 
a matrix a € M,(K) without multiple eigenvalues. If A is infinite we can do 
this in an obvious way (how?). If AK = F,, then there exists an irreducible 
over F, polynomial g(A) € F,[A] of degree & and we may use Exercise 7.2.4 in 
order to construct a. Another possibility is to use Exercise 7.2.3 for P = IF yx, 
because the minimal polynomial of a primitive element of P has no multiple 
Zeros. 


Corollary 7.2.6 Let 2, be as above and let 
k 
vi = So ul epp, yj =y,t>1. 
p=l 


The algebra Ri, generated by y, y5, y3,-.. 1s isomorphic to the generic matrix 
algebra Ry. 


Proof. Let = be the algebraic closure of the field of fractions of the poly- 
nomial algebra 2,. By Lemma 7.2.5, the generic matrix y, has no multiple 
eigenvalues. Hence there exists an invertible matrix z with entries from = 


such that the matrix uj = z~!y,z is diagonal. Let 


uj =z tyz, i= 1,2,... 


Denote by Uy the A-subalgebra of M;,(5) generated by wy, t2,.... Clearly 
U, is isomorphic to Ry. Let 6: Ry —> Ri, and w : Ri, — U;, be the K- 
algebra homomorphisms extending respectively the maps do : y% — y; and 
Wo: y; A uj, t= 1,2,.... Since the matrices u; are obtained as specializations 
of the “generic” matrices y; (in the class of all sequences of matrices, first of 
which is diagonal), ¢ is a homomorphism. The composition ¢ : Ry —> Ux, 
is the isomorphism defined by y; 3 uj = z~!y;z. This implies that Keréd = 0 
and, since ¢ is onto Rj, it is an isomorphism. 


Corollary 7.2.6 allows to replace one of the generic matrices by a diag- 
onal generic matrix which will be very useful in the further considerations. 
Sometimes, if we consider a single generic k x k matrix y, we shall use Greek 
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characters for the diagonal entries of y and shall write e.g. y = eat Np Epp 


instead of y = Seat Ypp pp, assuming that m1,...,79%. are commuting vari- 
ables. 


Exercise 7.2.7 Show that we may assume that in R;, the first generic matrix 
is diagonal and the second has the same first row and column, e.g. 


k k 
_ 1 _ 2 
w= 5 yy pp; Y2 = 5 I?) erg, 
p=l 


p,q=l 


and yh? = yl, gq = 2,3,...,k. 


Hint. Conjugate the algebra Rj, in Corollary 7.2.6 with a suitable invertible 
diagonal matrix z = Seat En€pp € M,(=). Then apply the arguments of the 
proof of the corollary. 


Exercise 7.2.8 Let A be a finite dimensional algebra with basis {a1,...,@m} 
over an infinite field A and let 


Q4= KU |p=l,....m,i=1,2,..] 


be the polynomial algebra. Show that the “generic” algebra Ry generated as 
a K-algebra by the elements 


m 
a . 
“= ) Map, t= 1,2,..., 
p=l 


is isomorphic to the relatively free algebra F(A). (In order to define the 
multiplication in Ra, we assume that the polynomials of 24 commute with 
the basis elements of A and the multiplication between a,,...,@m 18 as in A. 
In other words, we consider R4 naturally embedded into the tensor product 


of K-algebras 24 @ A.) 


Hint. Apply the same arguments as for the generic matrices in Exercise 7.2.2. 


7.3 Central Polynomials 


Definition 7.3.1 Let R be an algebra. The polynomial c(x1,...,¢,) € 


K(X) is called a central polynomial for R if c(a1,...,¢@,) has no constant 
term, c(r1,...,7%) belongs to the centre of R for all ri,...,7, € R, and 
c(&@1,...,%n) = 0 is not a polynomial identity for R. 


Exercise 7.3.2 Show that the polynomial 
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(a1, £2) = [x1, ao]? 


is central for the matrix algebra Mo(K). 


Hint. Use the Cayley-Hamilton theorem and the equality tr([#1, ¢2]) = 0 (see 
Exercise 2.1.7 (ii)). Prove that over any (also finite) field there exist matrices 


a,@2 € Mo(K) with det([a1, a2]) 4 0. 


In 1956 Kaplansky [133] gave a talk where he asked several problems 
which motivated significant research activity in the next decades. One of his 
problems was whether there exists a central polynomial for the matrix alge- 
bras M;,, (4) for k > 2. A revised version [134] of the problems of Kaplansky 
appeared in 1970. 


Problem 7.3.3 (Kaplansky) Does there exist a multihomogeneous central 
polynomial for the matrix algebra M,(K), k > 2? 


The answer to the problem of Kaplansky was given in 1972-1973 by For- 
manek [104] and Razmyslov [218] and this was very important for ring theory. 
We refer to the books by Jacobson [128], Rowen [231] and Formanek [108] to 
see how some important theorems were established or simplified using cen- 
tral polynomials. There are two essentially different methods for construction 
of central polynomials due to Formanek and Razmyslov. We shall give both 
of them. Later, an alternative proof of the existence of central polynomials 
based on results of Amitsur was given by Kharchenko [145]. Combining ideas 
of Formanek and Razmyslov with other methods, various central polynomials 
were also constructed by Halpin [124], Drensky [85], Giambruno and Angela 
Valenti [115]. We start with the central polynomial of Formanek. 

Let K[ui,...,uUx4i] be the polynomial algebra in k + 1 commuting vari- 
ables. We define a linear mapping @ (not an algebra homomorphism) from 
K[ui,...,Ug4i] to the free algebra K(x, y1,..., ye) of rank & + 1 in the fol- 
lowing way. If 


g(ui,..., Upga) = So aut? - UN, aa € K, 
then 
O(9)(@, 15-25 Uk) = So ae ye? yore ys... .e* year 


Exercise 7.3.4 Show that for 
k& 
z= S- Ppepps Yq = Cigdg € M;,(4K), Pp € K, P,Q = 1, + .,k, 
p=l 


and for g(ti,...,Ueqi) © Alai,...,urqil, 
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O(g)(%, Hi, oe 5 Uk) = G (Pays Piss oe «Pins Pin Jeixgr ++ Cinjp: 


Aint. Use that LCig = Pieijz, CigT = Pjeij- 
Theorem 7.3.5 (Formanek [104]) Let 


g(uiy---s tet) = TY (er —up)(uegr— up) TP (up = uy)’. 


2<p<k 2<p<qsk 


The polynomial of the free associative algebra K(x, y1,..-, Yn) 


C(%,Y1,-+-5 Yk) = O(g)(@, 41, +--+, YR) + O(g)(%, Yay. es Yes YL) + 
+... + O(g)(®, Yes Yiy +) Ye—1) 


is a central polynomial for the matrix algebra M,,(K) over every field K. 


Proof. It is sufficient to show that the polynomial c(Z, ¥1,..., J) is a scalar 
matrix when &, j1,..., Y, are generic matrices (why?) and ¢(z, y1,..., yn) = 0 
is not a polynomial identity for M,(A). By Corollary 7.2.6, we may assume 
that the generic matrix z is diagonal, 


k 
r= 5 En €pps 
p=l 


where €1,...,€% are commuting variables. Since c(a, y1,..., yz) is multilinear 
IN Y1,-.-,Yk, we may also assume that y,,...,%, are matrix units. Hence 


C(B, ti, ---5 Yr) = ) OE, Cigigy <-> ings Cirgrs ++ +s Cig—rgq—1) = 
g=l1 


k 
= ) Eig . Gin Sins . 5 Giga Sjq-1) Cig 26 Cine tigi ++ Cig-igg-1° 
g=l1 
Since we have chosen g in a very special way, 


Eig s ++ Sins Sir 5 ++ +5 Sigs Sjq-1) = 0 


if some of the variables €;, and Gi are equal or if €;,_, is equal to some &;, 
with p # q. On the other hand, if 41,..., i, is a permutation of 1,...,& and 
tq = Jqg-1, then 


Eig s+ 6s Sins Sins + +5 Sig 1s Sig) = 
= 9(€1,69,...,€,61) = II (Ep — Ep)’, 


1<p<p'<k 


> 2 
OCG) (2%, Ci gigas Cigtiigas 15 Cipiny Ciyins ++ +5 Cig rig) = II (Ep — Ep') Cigig: 
1<p<p'<k 
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Therefore, 
c(#, Ciyta) Cigtgy +++) Cip_ites Cixi) = 
k 
_ 2 2 
= TT G~ 60)" does = TT  -S1)%e, 
it q=l l<p<q<k 
if 71, %2,...,%% 18 a permutation of 1 .,& and 
e(e, Cixjis Ciajars++s Cin jn) =0 
otherwise. Now, the only statement to prove is that c(#,y1,...,yz) = 0 is 


not a polynomial identity for M;, (A). If # is a diagonal matrix with different 
eigenvalues p1,..., px we see that 


c(®, €12, €93,---,€k-1,k, ek) = II (Pp — Pq)”e 
L<p<ash 


which is a nonzero scalar matrix in M;, (4). If K = F,, we consider a matrix 
@ € M,(EF,) with different eigenvalues. Then @ is diagonalizable over some 
Fyn. Hence ¢(@,%1,...,9%) # 0 in My(Fgm). Since c(%,H,...,%m) 1s a lin- 
ear combination (with coefficients in Fy») of ¢(@, €i,3,,---, inj.) © Me(Fo), 
some c(#, €;,;,,---, €i,3,) 18 different from 0. This completes the proof of the 
theorem of Formanek. 


Now we give the Razmyslov approach to central polynomials. It uses the 
notion of weak polynomial identities [217, 218] and of the Razmyslov trans- 
form [218]. See the book by Razmyslov [221] for account on his methods 
containing also applications to many other problems of the theory of PI- 
algebras, Lie algebras with polynomial identities and groups with identical 
relations. 


Definition 7.3.6 The polynomial f(#1,...,%,) € K(X) is called a weak 
polynomial identity for the matrix algebra M,(K) if f(a1,...,a,) = 0 for 
all traceless matrices a1,...,@) in M,(K) (i.e. a1,...,@n belong to the Lie 
subalgebra sl;,(A’) of My (K)‘-). The weak polynomial identity is essential 
if it is not a polynomial identity for M, (4K). 


Exercise 7.3.7 Show that [x?, 22] = 0 is an essential weak polynomial iden- 
tity for Mo(K). 


Hint. If tr(a,) = 0 then the Cayley-Hamilton theorem gives that a? is a scalar 
matrix. Find matrices a,a2 € M2(K), tr(a,) 4 0, such that [a?, a2] 4 0. 


Exercise 7.3.8 If f(%1,...,@,) = 0 is a weak polynomial identity for 
M;,,(K), then f([@1, @n41], [t2, &nga],---, [@n, an]) = 0 is an ordinary poly- 
nomial identity. 
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Hint. tr([a, 6]) = 0 for any a,b € M,(K). 
Exercise 7.3.9 Show that the Capelli polynomial 


dy2(a1, 223 UR25Y1,-- 5 Yk241) 


in k? skew symmetric variables is an essential weak polynomial identity for 
M),(4) which vanishes when the x’s are replaced by elements of sl, (A) and 
the y’s are arbitrary matrices of M;,(K). 


Hint. Since dimsl,(K) = k? — 1, the polynomial d;2 vanishes when the ’s 
are replaced by elements of sl,(A’). On the other hand, d,2 is not zero on 
M;,(K) (see Exercise 7.1.2). 


Exercise 7.3.10 (Halpin [124]) Let the base field K be infinite and let the 
symmetric group S, act on the set {0,1,...,4 —2,k}. Show that the poly- 
nomial 


wW(@,Yi,-- +) Yk-1) = S- (signe)27y, 27 yo page Daye 8) 
oeS, 


is an essential weak polynomial identity for M),(4) which vanishes for « € 
sl,() and y1,...,ye—-1 © Mg (K). 


Hint. The Cayley-Hamilton theorem gives that 
a® — tr(a)a*—! +...4+ (—1)* det (a)e =0,a€ M,(K). 
If tr(a) = 0, then 1,a,a?,...,a*~?,a* are linearly dependent and 
w(a,b),..., 64-1) = 0, b1,..., bg-1 © Ma (XK). 


If z= eet Pp€pp, Show that 





w(Z, €12, €93,---,€s-1,k) = Alpi,---, pa )eik, 
where 

1 1 1 a 1 1 

ra p2 P3 +++ Pk-1 Pk 

Pi Ps PSs Pea Pe 

Alpi, + +s Pk) = : : : . : . |= 

k-2 k-2 ke 

Pi P2 P3 “++ Pr-1 Pr 

pi ph Ph wee Phy PR 

=(et+-..+pn) [] (a-pp)- 





1<p<qsk 
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Remark 7.3.11 The notion of weak polynomial identity was introduced by 
Razmyslov for several purposes. One of them was to construct central poly- 
nomials. Another purpose was to study the ordinary polynomial identities 
for matrices. For example, one of the main steps of the Razmyslov result 
[217] that the polynomial identities of the 2 x 2 matrix algebra over a field of 
characteristic 0 follow from those of degree < 6 is that the weak polynomial 
identities of Mz(A) are consequences (in sense similar to that for ordinary 
polynomial identities) of the weak identity [27,22] = 0. The notion of weak 
polynomial identity was further generalized to the case of representations of 
Lie (and other) algebras. We restrict our considerations in this direction to 
the definition only. 


Definition 7.3.12 Let G be a Lie algebra with universal enveloping algebra 
U(G) and let 
@:U(G) — EndV 


be an associative algebra homomorphism of U(G) to the algebra of linear 
operators of a vector space V. Then @¢ is called a representation of G. A 
polynomial f(#1,...,@%n) € K(X) is an identity for the representation ¢ if 


for all g1,.-.,9n EG. 
If we consider the Lie algebra sl;,(A’) with its natural k-dimensional re- 
presentation (as & x k matrices with trace zero), we obtain the notion of weak 


polynomial identity. For further details on identities of representations and 
their applications one may see the books by Razmyslov [221] and Vovsi [265]. 


Exercise 7.3.13 Let a;,6; € My(K), i= 1,...,n, and let 


for all matrices u € M;, (A). Show that 


= So iu —0 
t=1 
for all wu € M;,(K). 


Aint. Let 
k 


k 
=e Og pas bi = do as Crs, be me eK, 


g=l1 rjs=l 


Then 
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k n 
flew) = 2 (>: oie) ee =0, 


p,s=l \t=l 


and for any fixed p,g,r,s=1,...,k, 


3 


This implies that f*(esp) = 0, s,p = 1,...,%, and hence f*(u) = 0 for any 
ue M,(K). 


Exercise 7.3.14 Define a symmetric bilinear form on the vector space V = 
M;,(K) by 
(a,b) = tr(ab), a,b © M,(K). 
(i) Show that this form is nondegenerate, i.e. (u, V) = 0 implies u = 0. 
(ii) Derive from (i) a new proof of Exercise 7.3.13. 


Hint. (i) Multiply u by a suitable matrix a € M;(K) in order to obtain 


tr(ua) # 0. 
(ii) The equality f(u) = >>", a;ub; = 0 for all matrices u € M;(K) is 
equivalent to tr(f()v) =0 for all u,v € M,(K). Use that 


tr(f(w)v) = Dd tr(asubse) = do tr(brvase) = tr(f*(v)u) = 0 


for all u,v € M;,(4) implies that f*(v) = 0 for all v € M,(K). 


Exercise 7.3.15 Show that any matrix with trace equal to zero in M;,(K) 
is a linear combination of commutators. 


Hint. Show this for the basis of sl, (4) consisting of e;;, i # 7, and €11 — ei, 
t= 2,...,k. 


Definition 7.3.16 Let the polynomial f(x, y1,...,Yn) € K(,y1,.--, Yn) be 
linear (i.e. homogeneous of degree 1) in the variable x. Writing f in the form 
f = dl geh;, where gj,h; © K(yi,..-,¥n), the Razmyslov transform of f is 
the polynomial 


Pan, oe 5 Yn) = So hiegi- 
For example, if 


f(@,my2) = lem tye, ye] =l-e-yye ty: @- yo-yo ey — yom a1, 


then 
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P° (©, 41, ye) = nye @ T+ yo ayy -@-yo—1-w-your = [yo, ely t+yi[ye, a]. 
Lemma of Razmyslov 7.3.17 Let the polynomial 


F(®,Y15---5 Yn) € K(8,y1,---54Yn) 


be homogeneous of first degree in « and let f*(x,41,..., Yn) be the polynomial 
obtained by the Razmyslov transform. Then: 

(i) f(@,Y1,---,Yn) ts @ polynomial identity for the matrix algebra M;,(K) 
if and only if f*(x,y1,.-.,Yn) ts a polynomial identity. 

(ii) f*(%,y1,---,Yn) is @ central polynomial for the algebra M;,(K) if 
and only if f(@,Y1,---,Yn) ts an essential weak polynomial identity and 
F([@, yo], ¥1,---5 Yn) 18 @ polynomial identity for M,(K). 


Proof. (i) Let 


m 


F(®,Y1,---5Yn) = Se ai( (Yi,--- Un yabs(yi,.--5 Yn) 


i=l 


and let, for fixed r1,...,%n € My (4K), 


vs 


f(u) = fluri,...,7n ay(T1,..-,Tn)ubs(71,..-,7n), ue Ma(K). 


i=l 


By Exercise 7.3.13, f(u) = 0 for all wu € M;,(K) if and only if f*(w) = 0 and 
this gives the statement of (i). 
(ii) Assuming again that 


m 
F(®,Y1,---5Yn) = Se ai( Yis-+ +5 Un yabs(yi,.--5 Yn) 
i=l 


we apply the Razmyslov transform to 


m 
g(®, Yo, M1; e 5 Yn) = F(lx, yol, e 3 Yn) = So ai( (Yo —_ You) )b; 
i=l 
and obtain 
m 
GF (®, YO, Ys. Yn) = S"(yobiaas — bjxa;yo) = 
i=l 


= [os bir] = = [yo F(@,¥15---5Yn)]- 


Obviously, f*(%,y1,---, Yn) is a central polynomial (maybe vanishing) for 
M;, (4) if and only if 
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I (@, Yo, Yis---5 Yn) = [yo, f° (@, Y1,---5 Yn) 


is a polynomial identity and this means that f([#, yo], y1,---,Yn) = 0 isa 
polynomial identity. Clearly f*(z,y1,..., Yn) does not vanish on M;,(K) if 
and only if f(#, y1,-.-, Yn) is not a polynomial identity. In order to finish the 
proof it is sufficient to use Exercise 7.3.15 that any matrix with trace zero is 
a linear combination of commutators which means that f(%,y1,..., Yn) = 0 
is a weak polynomial identity. 


Theorem 7.3.18 (Razmyslov [218]) Let 
f = f(e, “1, sey %2h2-2,41,-- 65 YR2-1) = 


= dyo(a, [21, 22],.-., [Zon2-3, Zon2—2]5 1, 41, ---, Ye2—1, 1) 


where dn(%1,---5%njYis---5Yn4i) is the Capelli polynomial. The Razmyslov 
transform applied to f gives a multilinear central polynomial for M),(K) over 


any field K. 


Proof. By Exercise 7.3.9, f(a, 21,..-, Y¢2-1) = 0 is a weak polynomial iden- 
tity because it vanishes if we replace x by a traceless matrix (the matrices 
[za:-1, 224] are already traceless). It is also clear that the replacement of x by 
a commutator gives rise to a polynomial identity. By Lemma of Razmyslov 
7.3.17, it is sufficient to show that f(x, z1,...,yx2-1) = 0 is not a polynomial 
identity. We choose 2k? —2 traceless matrices Z; € M;,(K) such that the set of 
commutators [79;_1, Za], = 1,...,k? — 1, coincides with the basis of sl, (K) 
consisting of all ej;, i A 7, and ey — e11, 7 #1. Replacing x by # = e1; and 
using the skew symmetry of the Capelli identity we obtain that 





f(z, 21, oe -) Z2k2-3, Z9h2-9; Y1;-- -5 Yk2-1) = +dy,2(€:5; 1, Y1ly +++) YR2-1) 1), 


where d,2(e:j; 1, 41,-.-, Ys2-1, 1) means that we have replaced the skew sym- 
metric variables by the k? different matrix units. Now, as in Exercise 7.1.2, we 
may choose matrix units y; to obtain a nonzero value of the Capelli identity. 
By Lemma of Razmyslov 7.3.17 the polynomial f* is a central polynomial 
for M,(K). 


Using the method of Razmyslov, Halpin [124] constructed another central 
polynomial. We give a simplified version of his theorem. 


Exercise 7.3.19 (Halpin) Let chark = 0, let S, act on {0,1,2,...,k-—2,k} 
and let 


1 
W1 (Hy 21,0665 Sms Yly +s YR—-1)s ma klk — 1), 


be the complete linearization of 


WL, Y1,- ++) YR-1) = S- (signe) 2? ya? yo 20 Dag ae 8), 
oeS, 
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(i) Let fp(a,Z7,U,V,Y) = 


= w1(&, 21,.-., Zp, (ur, v1],---, [Um—p, Um—p], Y1y +--+, YR—-1), P= 9O,1,...,m. 


Show that there exists a p such that the polynomial f> (x, 7,U,V,Y) obtained 
by applying the Razmyslov transform 7.3.16 to f,(#, Z,U,V,Y) is central for 
M;,(K). 

(ii) Show that the only p with the property as in (i) is p= 0. 


Hint. (i) Use Exercise 7.3.10 and its proof. See that fn(x, 7, U,V, Y) is a weak 
but not a polynomial identity. Show that fo([#,t],U,V,Y) is a polynomial 
identity. Therefore f,(z,Z,U,V,Y) satisfies the assumptions of Lemma of 
Razmyslov 7.3.17 for some p. 

(ii) Very probably there will be difficulties with this exercise. If no success, 
see the original paper by Halpin [124]. 


Exercise 7.3.20 Let chark = 0. Find the linearization w(x, z,y) of the 
weak polynomial identity w(x,y) = [x?,y] for Mo(K) (ie. the multilinear 
component of w( + z,y)). Show that w(x, [u,v], y) is not a polynomial 
identity for Mz(K) and wy1([«, ¢], [u, v], y) is. Apply the Razmyslov transform 
to wi(a, [u, v], y) and obtain a central polynomial for M2(K). 


Answer. Up to a multiplicative constant and the names of the variables, one 
obtains 
[v1, v2][x3, v4] + [w3, va][x1, 2]. 


Latyshev and Shmelkin [168] constructed a central polynomial in one 
variable for M;() when the field A is finite. 


Exercise 7.3.21 Let K = F, be a finite field and let p(x) be an irreducible 
polynomial of degree k in F,[x]. Show that the following polynomial in one 


variable 
k-1 


qg® _ (q* -1)k 
c(@) = II (at — x(t Dk (: ‘| 
mol 





p(@) 
is central for M4; (F,). 


Hint. Let a € M;,(F,). Each eigenvalue of a belongs to some extension Fyn 
of F, with m < k. Conjugating a by an invertible matrix with entries from an 
extension of IF,, we consider a in its Jordan normal form. If the eigenvalues 
of a are not zeros of p(x), then c(a) = 0. By Exercises 7.2.3 and 7.2.4, there 
exists a matrix a € M;,(F,) with eigenvalues among the zeros of p(x). This 
matrix has no multiple eigenvalues (why?) and hence its Jordan normal form 
(in M;, (Fj )) is diagonal. Hence c(a) = g? ~*(a), where g(a) is diagonalizable 


matrix with nonzero eigenvalues from F,.. Therefore c(a) = g? ~*(a) =e. 
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Considering some important algebras, as the generic matrix algebras, a 
natural question is how big is the centre of the algebra, which gives some 
rough idea how far from commutative is the algebra. Tracing the literature, 
one can see that for different purposes one needs different central polynomials. 


Problem 7.3.22 Find new central polynomials for the matrix algebras 
M,,(K), k > 2. 


The complete description of the central polynomials is known for 2 x 2 
matrices only (chark = 0), see e.g. the papers of Formanek [105] and Okhitin 
[200] for different descriptions. 

The next exercises may be useful for searching for central polynomials of 
low degree and show that over a field of positive characteristic the complete 
description of the central polynomials should be difficult even for M2(K). 


Exercise 7.3.23 Prove that every central polynomial for M;,(K) is a poly- 
nomial identity for M,_1(K). 


Hint. Embed the algebra M,_1(A’) into M;,(K) assuming that the k-th col- 
umn and row of a € M,_1(K) are zero. The evaluation of the central poly- 
nomial is a scalar k x k matrix with 0 in the &-th position of the diagonal. 


Exercise 7.3.24 Prove that every central polynomial for M@;,(K), k > 1, isa 
polynomial identity for the algebra of & x & upper triangular matrices U,(K) 
with entries from K. 


Hint. Let e(r1,...,7%n) = ae, where 0 #4 a € Ky 14,...,% © Up(K). Show 
that aeq1,...,eg—1,n~-1 depend on the first & — 1 rows and columns of 
’1,---,?%, only, and do not depend on the last k-th column. Then apply 
Exercise 7.3.23. 


Exercise 7.3.25 (i) Let f(%1, @2, #3, #4) be a multilinear polynomial of de- 
gree 4 in Fo(X). Show that f(a1, #2, £3, %4) is a central polynomial for Mz(Z2) 
(not necessarily nontrivial) if and only if it is a linear combination of 


[T4, %o(1)] © [%o(2); o(a)], 7 € $3, (2) > o(3), 
and 
c(21, #2, 43, 24) = [x4, ¥1][r2, 23] + [v4, v2] [z3, 21] + [v4, v3][21, ze]. 


Show that c(#1,#2,#3,%4) = 0 is a weak but not an ordinary polynomial 
identity for Mo(Za). 

(ii) Show that if charkK # 2 then the multilinear central polynomials of 
degree 4 for M2(K’) are linear combinations only of 


[T4, %o(1)] © [%o(2); o(a)], 7 € $3, (2) > o(3). 
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Hint. By Exercise 7.3.24 and by Theorem 5.2.1 for the polynomial identities 
of the upper triangular matrix algebras (which does not depend on the field 
if we consider multilinear identities only) a multilinear central polynomial of 
degree 4 is a linear combination 


f(#1,%2,%3,%4) = S- A [%o(1); ®o(2)|[Lo(3); Foal 


where o € S4, o(1) > o(2), (3) > o(4). Replacing x1, x2, #3, %4 with matrix 
units and assuming that the result is a scalar matrix, we obtain a homoge- 
neous system with 6 unknowns ag. It turns out that the polynomials given 
above are obtained from solutions of the system and any other solution is 
their linear combination. We shall illustrate this (with some small additional 
tricks) in the case of characteristic 2. Since we work with multilinear identi- 
ties we may assume that AK = Zo. First we write the standard polynomial s4 
in the form 


84(@1, €2, 03,4) = S lea, £o(1)] © [Lo(2); Fo(a)], 7 € $3, 0(2) > (3). 


If tr(a;) = 0, ¢ = 1,2, by the linearized version of the Cayley-Hamilton 
theorem (as in the Razmyslov proof of Amitsur-Levitzki Theorem 7.1.6) we 
obtain that a,°oa2 is central, therefore we obtain two more central polynomials 
of the form 

[t4, %o(1)] © [@o(2), €o(3)], 


e.g. for o(1) = 1 and o(1) = 2. (The third central polynomial (for ¢(1) = 3) 
is a linear combination of s4 and the other two). Now, ¢(#1, 2, #3, 4) is skew 
symmetric in #1, %2,%3. We choose a basis of Mz(Z2) 


E11, €12, €21,€ = [e12, e21] 
We need to calculate only the cases (why? — proper polynomial identities!) 
c(e11, €12, €21, €11) = €, c(ei, €12, €21, €12) = 0. 
Now we consider the six-dimensional vector space with basis 
[to(1), Fo(2)][Po(a), ®o(ay], F € Sa, (1) > (2), 0(3) > ofA), 
and choose a new basis consisting of the four found central polynomials and 
[a4, 21][e3, eo], [aa, val][ars, vi]. 

It is sufficient to show that every nontrivial linear combination 

f (1, €2,%3, 84) = a[e4, x1] [¥3, v2] + Blea, xo][v3, x1] 
cannot be a central polynomial. For example 


Fei, €12, E11; €91) = A699 
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and a = 0. Similarly, @ = 0. 


If not explicitly stated, till the end of the chapter we assume that charK = 
0. 


Problem 7.3.26 Find the minimal degree of the central polynomials for 
M,,(K), chark = 0. Find the minimal degree of the central polynomials in 
two variables. 


The central polynomials of Formanek [104] are of degree k?. The original 
polynomials of Razmyslov [218] were of degree 3k? — 1 but using other weak 
polynomial identities, Halpin [124] also reduced the degree to k?. There was 
a believe that the answer to the above problem is k? and this is the case for 
k= 1,2. 

For k = 3 Drensky and Azniv Kasparian [91, 92] proved that the minimal 
degree of the central polynomials is 8. The new central polynomial of degree 
8 was obtained using ideas of the Rosset proof of Amitsur-Levitzki Theorem 
7.1.8. The approach to show that there are no central polynomials of degree 
7 was based on computations as in Exercises 7.3.23 — 7.3.25 combined with 
techniques of representation theory of the general linear group (see Chapter 
12 for the method of representation theory of groups). 


Conjecture 7.3.27 (Formanek [108]) The minimal degree of the central 
polynomials for M;,(K) over a field K of characteristic 0 is equal to 


mindeg(M,(K)) = stk + 3k — 2). 


Although this is not a reason, the only quadratic function p(k) with p(1) = 
1, p(2) = 4 and p(3) = 8 is (hk? + 3k — 2). What is more important, the 
conjecture of Formanek agrees with some other conjectures in the theory 
of Pl-algebras. For k = 4 Drensky and Giulia Maria Piacentini Cattaneo 
[94] found a new central polynomial of degree 13. (Pay attention: $(4 + 
3.4 — 2) = 18. Unfortunately we do not know whether M4(K) has central 
polynomials of degree 12.) Their construction uses a weak polynomial identity 
of degree 9 and combines the methods of Formanek and Razmyslov. The 
result was generalized by Drensky [85] to construct central polynomials of 


degree (k — 1)? + 4 for M;,(K), k > 2. 


Problem 7.3.28 Find the minimal degree of the weak polynomial identities 
for M,(K), chark = 0. Find weak polynomial identities of low degree which 
induce central polynomials of low degree. 


The answer is known for k = 2 where the theorem of Razmyslov [217] 
shows that all weak polynomial identities are consequences from [x?, x2] = 0 
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(see Remark 7.3.11) and for & = 3, where Drensky and Tsetska Rashkova 
[95] described all weak polynomial identities of degree 6. (There are no weak 
identities of lower degree for M3(K)). These weak identities give the central 
polynomials of Halpin (of degree 9) and one of them gives a central polynomial 


of degree 8. 


7.4 Various Identities of Matrices 


Exercise 7.4.1 Show that the matrix algebra M,(K) does not satisfy poly- 
nomial identities f(a, y1,...,¥s—1) = 0 which are multilinear in y,..., ym—1. 
(Remember that the field is of characteristic 0, and hence is infinite.) 


Hint. Try to find two different solutions. (i) Replace « by a generic diagonal 
matrix and the y,’s by the matrix units e, 41. (ii) Show that f cannot be 
written as a linear combination of products containing k or more commutators 
and use that every identity for the matrix algebra is also an identity for the 
algebra of upper triangular matrices. 


Exercise 7.4.2 (Bergman [84]) Let 


_ a Gk4+1 la 
g(t, ..., Urq) = ) Agu... Upy ty, Aa EK, 


be a polynomial in commuting variables and let 
Ag) (@, 915-5 Uk) = So at ye? yew ys... .0* year 


be the polynomial in A (a, y1,..., ym), as in the Formanek construction of the 
central polynomials (Exercise 7.3.4 and Theorem 7.3.5). 

(i) Show that @(g)(#, y1,-.-, ys) = 0 is a polynomial identity for M),(K) if 
and only if the polynomial g(u1,..., x41) is divisible by all up—ug, 1< p< 
q<k+1. Show that in this case @(g)(x, y1,..., ys) follows from the identity 
of algebraicity in Exercise 2.1.8. 

(ii) Let f(@,y1,.--,yn) © Ka,y,..., yn) be multilinear in y1,..., yp. 
Write f in the form 


f(@.y, oe 5 Uk) = S/O gi) (®, Yin, oe 65 Yig)s Ji € K[ui, oe ., Ug. 


Show that f(%,y1,.--, Ym) = 0 is a polynomial identity for M; (4) if and only 
if each 0(g;)(%, ¥:,,---,Yi,) = 0 is a polynomial identity. 


Hint. (i) Replace « by a diagonal generic matrix and the y,’s with the matrix 
units 


E12, €23, +--+) Ci-1jiy ++ +) CF-1,F5 Cfis Ci,g41s EF41,7+2)---) €K-1,k) 
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see Fig. 7.2 for the corresponding oriented graph. 


Fig. 7.2. The graph corresponding to {e12, €23, €34, €45, €56, €67, €73,, €38 } 


Show that if @(g) = 0 is a polynomial identity, then all differences u, — uy 
divide g. On the other hand, if u, — uy divides g for all pairs (p,q), p # 
q, then replacing x by a diagonal matrix and the y’s by matrix units, we 
obtain 0. Show that the identity of algebraicity corresponds to the product 
g = [[ (tu — u,). Show that 


A(t g) (2, Y1,---5 YR) = O(g)(%, Y1,- ++, £Yp, +--+ Yk) = 


= A(g) (x, m1, e +> YUp-1%, e 5 Uk) 


and derive that all polynomial identities of this form follow from the identity 
of algebraicity. For (ii) try to modify the above arguments. If some difficulties 
arise see the original proof of Bergman [34] (or, if his preprint is not available 
to the reader, see the paper by Drensky and Tsetska Rashkova [95] where a 
similar result is obtained for weak polynomial identities). 


In the next several exercises we again assume that the field K is arbitrary. 


Exercise 7.4.3 Show that A (a, y) has a basis 


{aPoy® [ar, yla?ty™ [ay]. .aPP-ty™ [a ylaP y™ | pi, qi > 0,k > Of. 


Hint. Using the equality ye = xy—[«, y], we can write any element of K(x, y) 
as a linear combination of the above elements. In order to see that the el- 
ements are linearly independent, use one of the following two possibilities. 
(i) Show that any nontrivial linear combination of these elements does not 
vanish on some algebra of generic upper triangular matrices (as in Theorem 
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5.2.1). (ii) Assume for a while that these elements are linearly independent. 
Compute the Hilbert series of the vector space spanned by them. It is 


n= ae (wap) > 


k>0 


1 
= a= Hilb( K(x, y),t). 
If our elements are linearly dependent, this will give us that some of the 
coefficients of the Hilbert series of the vector space spanned by them are 
smaller than the corresponding coefficients of the Hilbert series of K(x, y). 





Exercise 7.4.4 Let 


R= K(e,y)/([e7,y], [y’, ]) 


be the factor algebra of K(a,y) modulo the ideal generated by [x?, y] and 
[y?, 2]. Show that as a vector space R has a basis 


{x%y’[x, yl° | a,b, ¢ > Of. 


Hint. We work in R using the same symbols # and y for the images of the 
free generators of K(x, y). Since 


0= [x,y] = ala, y] + [z, y]a, 0= [y’, a] = —yle, yl ~~ [z, y]y, 
we obtain that 
[x, yla = —ale, yl, [x, yly = —ylx, yl. 
Applying Exercise 7.4.3, we see that R is spanned on 
{wy?[x, yl° | a,b,c > O}. 


First, let chark # 2. The algebra generated by two “generic” traceless ma- 


trices of the form 
_f€& 0 _f{n7 ¢ 
r= (j fre? ‘i 


(where €, 7, ¢ are algebraically independent commuting variables) satisfies the 
defining relations of the algebra R (check it!). Assuming that € > 7 > ¢, show 
that the leading term of u*v?/[u, v]® is equal to 


atten Cer - en)? [er —e€99,€12 ten},,OFaEK. 


Derive from here that the elements x7y°[a, y]° are linearly independent in R. 
If chark = 2, use the generic traceless matrices 





uy = €1(€11 + €22) + mi(ere + €21), Ua = Eo(e11 + €22) + re12 
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and the ordering £ > 7, > 2 > 1. 


Remark 7.4.5 Exercise 7.4.4 shows that the factor algebra of AK (a, y) mod- 
ulo the ideal generated by [x”, y] and [y?, x] is isomorphic to the factor algebra 
of K(x, y) modulo the weak polynomial identities of Mo(K). This is a par- 
tial case of a general result concerning weak polynomial identities of 2 x 2 
matrices, see Exercise 12.6.12 and Remark 12.6.18. 


Exercise 7.4.6 Show that the following two systems of elements in K(«, y) 
are independent (i.e. are systems of free generators in subalgebras of K (a, y)). 


y, yr, yx, ye, ..., 
y[e,y)2', kL > 0. 


Hint. Show that the algebra generated by y, yx, yx7, yx®,... is spanned by 
the words yx"... yx’? and the presentation 


ya"... yate = (ya)... (ya!) 
is unique. For the second set use similar approach applying Exercise 7.4.3. 


Exercise 7.4.7 Show that every Pl-algebra satisfies a polynomial identity 
in two variables. 


Hint. If R satisfies no polynomial identity in two variables, then 
Kia, y)AT(R) =0, Fo(R) = K(a,y)/(K (a, y) NT(R)) = K (2, y) 


and by Exercise 7.4.6, F(R) contains a free associative subalgebra. Hence 
both Fo(R) and R satisfy no polynomial identities at all. 


Exercise 7.4.8. Give a new proof of Exercise 7.1.13. Show that the polyno- 
mials 


sor([z, y], la, yl, [z, ye. ola, yl, yle, yle, [, yle”, y* [ay], y?[2, gle, -..) 


are nonzero in A(x, y) and, for k sufficiently large, are of degree less than the 
degree of the identity of algebraicity for MM, (4K). 


Hint. Use Exercise 7.4.6 and the fact that 
deg sox ([x, y), yla, y), [a, ye, e ) = O(kVk) 
and the degree of the identity of algebraicity is O(k?). 


We complete this section with short discussions on other T-ideals in some 
sense similar to the matrix T-ideals. 
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Remark 7.4.9 Over a field of characteristic 0, Latyshev [166] called a T- 
ideal T(R) stable if the set of its multilinear polynomials is stable under 
the Razmyslov transform 7.3.16. He started the systematic study of stable 
T-ideals. Further development was given by Okhitin [199], who proved that 
the stable T-ideals have many properties similar to those of the T-ideals of 
polynomial identities of matrices. In particular, he established that if T(R) 
is stable, then R has central polynomials. 


8. Multilinear Polynomial Identities 


The starting point of our considerations was the claim that the class of PI- 
algebras is reasonably big and enjoys the most important properties of com- 
mutative and of finite dimensional algebras. One of the goals of this and the 
next chapters is to give a quantitative confirmation of this statement. 

In this chapter we consider multilinear polynomial identities. Over a field 
of positive characteristic not all polynomial identities of an algebra follow 
from the multilinear ones. Nevertheless we shall see that the multilinear iden- 
tities carry a lot of information about all identities. Up till now, most of our 
considerations have involved computing with the identities of concrete im- 
portant algebras. Now we go from one extreme to the other and assume that 
the only information we have is that the algebra satisfies some polynomial 
identity. The main quantitative result of the chapter is the theorem of Regev 
for the exponential growth of the codimension sequence of a PlI-algebra. As 
a consequence we give another important theorem of Regev that the tensor 
product of two Pl-algebras is again PI. This is one more confirmation that 
the class of PI-algebras is nice: it is closed with respect to natural algebraic 
operations. Then, till the end of the section we consider algebras over a field 
of characteristic 0. First we study the Pl-algebras with polynomial growth 
of the codimension sequence and give some description in the unitary case. 
Then we prove one of the corner stones of the PlI-theory and its applica- 
tions: the Nagata-Higman theorem that a nonunitary algebra which is nil of 
bounded index, is nilpotent. Finally, we give some (very slight) idea about 
the structure theory of T-ideals developed by Kemer and prove his theorem 
that the standard identity implies the Capelli identity. 


8.1 The Codimension Theorem of Regev 


In this section we assume that the base field K is arbitrary. Although we 
still have our convention to consider unitary algebras, the main results hold 
without any changes also in the nonunitary case. First we prove the theorem 
of Regev [222] for the exponential growth of the codimension sequence of 
a Pl-algebra, 1.e. for every Pl-algebra R there exists an a > 0 such that 
én(R) = O(a”). (Compare the asymptotic behaviour of c,(R) with this of 
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dimP,, = n! which grows much faster than any exponential function.) We give 
the proof of Latyshev [163] based on the Dilworth theorem in combinatorics 
[61]. The version of the Dilworth theorem used in our exposition is proposed 
by Amitsur. This approach to the proof of the theorem of Regev is considered 
to be standard nowadays. 


Definition 8.1.1 A set P with a relation < (notation (P,~<)) is partially 
ordered if the relation is transitive, i.e. a ~ 6 and 6 ~ ¢ for a,b,c € P implies 
a < c and if the relations a ~ 6 and 6 ~ a cannot be satisfied simultaneously 
for any a,b € P. The elements a,,...,a, form a chain in P if ay ~ ... ~ ag; 
they form an antechain if the relation a; < a; does not hold for any pair 
(a;,a;) with different i and j. 


Below we give without proof the original version of the Dilworth theorem. 


Theorem 8.1.2 (Dilworth [61]) Let (P,~<) be a partially ordered set. Then 
the number of chains of P into which P can be partitioned (1.e. presented as a 
disjoint union) ts equal to the maximum number of elements in an antichain 


of P. 


One idea for the following definition comes from the Shirshov approach 
to combinatorics of words and its applications to Pl-algebras. (Compare Def- 
inition 8.1.3 with Definition 9.1.5.) 


Definition 8.1.3 For a permutation 7 in S, we denote by d(z) the largest 
number d for which there exist integers 1 < t1 < tg <1... < tg <n such 
that 7(%) > w(i2) > ... > w(ia). For a fixed integer d > 1, the permutation 
m in Sy is called d-good if d() < d. In other words, for a fixed 7 € S,, we 
introduce a partial ordering on the set P = {1,2,...,n} in the following way: 
i<j if and only if i < j and a(2) < 7(j). Then d(x) is the maximum length 
of the antichains in {1,2,...,n} and z is d-good if there are no antichains of 


length d. 


For example, let n = 6 and 


_f1 2 3 4 5 6 
"\5 32 4 1 6)° 
Then d(z) = 4, since 1 < 2 < 3 < 5 and a(1) > 2(2) > 7(8) > 2(5) (and 
there are no antichains of length 5). Hence m is 5-good (and also 6-good). 


Definition 8.1.4 For a permutation 7 € S, we construct a pair of tables 
Ti (m7) = (tj;) and To(7) = (uj) in the following way. We set #1), = 1, wi = 
a(1). By induction, if t); exists, it is the smallest & such that t1;-1<k<n 
and m(k) > uij;-1. Then we set uj; = 7(k). When we cannot find the next 
t1; we start with the second rows of Ti(a) and T(z): to: is the smallest 
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k € {1,2,...,2} which does not belong to the first row of T, (7), wai = m(t21) 
and, if 7 > 1, then ty; is the smallest k not in the first row of T)(a) such that 
toj-1 < k <n and a(k) > wo j;-1; we set wo; = 7(k). When we finish the 
second row, we continue with the third, etc. 


For example, let n = 6 and 
rH 12 3 4 5 
~\4 3 5 1 2 ‘ 


We give the consecutive steps for constructing T)(7) and T (7): 


loner) 





tit 1, Ul11 m(ti1) 4: 





1. T, = (1), % = (4); 


m(2)=3<4=u1, 7(3) =5>4= uy, hence tyg = 3, wig = 5: 


2. T, = (13), Ts = (45); 

m4) =1<5= we, 7(5) = 2 <5 = we, 7(6) = 6 > 5 = ua, hence tis = 6, 
ui3 = 7(6) = 6: 

3. T, = (136), T) = (456). 


We cannot continue the process. The integers left in a are 
_f* 2 * 4 5 x 
"\e Boe 12 * 


and we start with the second rows of T\(m) and To(7): 
toy 2, U21 m(to1) 3: 


1 3 6 4 5 6 
4. n= (4 ) n= (4 ). 


Since 7(4) = 1 < 3 = ua1, 7(5) = 2 < 3 = tg), the process stops again and, 
considering 
_f{* * * 4 5 x 
7 (: * * 1 2 ') , 


we continue with the third rows. We set ts; = 4, w3i = m(t31) = L: 














3. 6 5 6 


1 4 
5. Ti =| 2 Te= [3 
4 1 


Since 7(5) = 2 > 1 = ug, we complete the third rows of the tables Ti (m) and 
To(1) by t32 5, U32 n(t32) 2: 
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3. 6 5 6 


1 4 
6. T,-|2 I= 13 
4 5 1 2 
Lemma 8.1.5 (Amitsur, see [231]) For a permutation 7 € S,,, the integer 


d(x) is equal to the number of the rows d of the table T,(z). 


Proof. By the construction of the tables T\(m) and T2(7), if t1 <...< tain) 
and m(i1) >... > t(tacry), then %1,...,%a(r) are in different rows of 7) (7). 
Hence d(z) < d. Now we shall construct a sequence 4) < ... < ég with 
m(t1) >... > m(tg). We start with 7g = tq and, by induction, if ¢,41 is in the 
k+L-st row of T, (a), we define i, = t,; for the largest j such that th; < de41. 
If uxgj < W(te41), then t%41 should be in the k-th row of T; (a), which is not 
true. Therefore, 
T(tk) = m(tej) = Ung > T(te41) 


and we can continue the process. This gives that d(m) = d. 


Theorem 8.1.6 (Latyshev [163]) [f R is a Pl-algebra and the T-ideal T(R) 
contains a polynomial identity of degree d, then the vector space of multilinear 
polynomials of degree n in K(X) is spanned modulo T(R) by the monomials 
Ur(1)++-€x(n) where t ES, is d-good (i.e. d(m) < d). 


Proof. Since T(R) contains a polynomial identity of degree d, it contains 
also a multilinear polynomial of degree d. Without loss of generality we may 
assume (why?) that R satisfies an identity of the form 


LqUq-1...U, = S- AgXg(1)2o(2)---La(d), We EK, 
oeSa 
and the summation is over all permutations o different from 6 € Sg, where 
_ 1 2 ... d-1 d 
~\d d-1.... 2 1} - 
We shall work in P,(R) = Ph/(PnOT(R)), ie. in the vector space of mul- 
tilinear polynomials of degree n modulo the polynomial identities of R. Let 
Gq be the set of all monomials %_(1)...@z(n) € Pn(R) such that the permu- 


tation 7 € S, is d-good. We shall show that P,,(R) = span(Ga). We order 
lexicographically the monomials in P,, (2) assuming that 


®e(1)+++oe(n) < Berl) +++2r(n) 
if and only if for some k 


o(1) =7(1),...,0(k) = 7r(k),o(k +1) < r(k 41). 
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Let h = &(1)---€n(n) be the minimal monomial which is not in span(Gq). 
Hence d(z) > d and there exist 4) < ... < tg with m(#,) > ... > (ta). We 
write A in the form 


h = hota, hita(igyhe-.-Ra-1¥ a (ig) ha; 
apply the polynomial identity of degree d for 
Eq = Uq(j,)h1,Ld-1 = La(ig)ha,.-., 2 = La(ig_y)hd-1, %1 = Ua (igyha 
and obtain 


h=ho- (€a€a-1 .. #1) = S- Agho®g(1)®o(2) .. -£o(d) = 
oeSa 


= Do ac horn tiga) hold) @a tocar) Ro(d=1) ++ Baliga) Molt): 
oeSa 
Since m(i,) >... > (ig) and the summation is on o # 4, we obtain that all 
monomials in the latter sum are below than A in the lexicographic ordering 
and, by inductive arguments, belong to the vector subspace span(G4a) spanned 
by the set Gg corresponding to the d-good permutations. Hence fh also belongs 
to span(Gq). This completes the proof of the theorem. 


Now we are ready to prove the theorem of Regev for the exponential 
growth of the codimensions of a PlI-algebra R with the estimate obtained by 
Latyshev. 


Theorem 8.1.7 (Regev [222]) Let R be a Pl-algebra with a polynomial iden- 
tity of degree d. Then the sequence of the (multilinear) codimensions of the 
polynomial identities of R satisfies 


en(R) < (d—1)*", n=0,1,2,... 


Proof. By Theorem 8.1.6, it is sufficient to show that the number of the d- 
good permutations in S,, is bounded by (d—1)?". By Lemma 8.1.5, for any d- 
good permutation 7, the tables T; (a) = (t;;) and To(7) = (ui;) constructed in 
Definition 8.1.4, have less than d rows. Since m(tj;) = wz, every permutation 
m is uniquely determined by the pair (T\(m), 72(7)). In each row of T, (a) and 
To(m) the integers ¢;1, tj2,... and uj;1, wa, ... increase. Each integer 1,2,...,” 
can be written in one of the d—1 rows of T; (7) and in one of the d—1 rows of 
To(m) (maybe not all pairs of tables correspond to substitutions). Hence the 
number of the pairs of tables with less than d rows is bounded by (d—1)?”. 
This completes the proof of the theorem. 


Exercise 8.1.8 Prove directly, using Theorem of Dilworth 8.1.2, that the 
number of the d-good permutations in S,, is bounded by (d— 1)?”. 
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Hint. Let 7 be d-good. Consider the partial ordering in {1,2,...,} induced 
by z. Then the maximum length of the antichains in 7 is < d—1 and by 
the Dilworth theorem, the set {1,2,...,n} can be partitioned in < d— 1 
chains, say Ci,...,Cp, p < d— 1. Every chain corresponds to a sequence of 
pairs (#1, 7(t1)),..., (én, t(tm)) with i) <... < i, and m(t1) <... < (tg). 
Define functions t : {1,2,...,n} > {1,...,d—1} and wu: {1,2,...,n} 3 
{1,...,@— 1} by the rule t(@) = j and u(a(é)) = 7 if (¢,7(%)) is in the j- 
th chain C;. Show that different d-good permutations correspond to different 
pairs of functions (¢, u) and the total number of pairs of functions is (d—1)?”. 


Exercise 8.1.9 Show that for any Pl-algebra R, the codimension series of R 
c(R,t) = So en(R)t” 
n>o 
has a positive radius of convergence. 


Hint. Use that if R satisfies a polynomial identity of degree d, then the 
codimension series is majored by 


2n4n 1 
So (d-1)""t = Tay 


n>o 


It is remarkable, that using purely quantitative approach, Regev [222] 
succeeded to prove a purely qualitative result and to answer in the affirmative 
the problem of Kaplansky whether the tensor product of two Pl-algebras is 
again PI. 


Theorem 8.1.10 (Regev [222]) The tensor product R= R, @xK Ro of two 
Pl-algebras R, and Ry is also a PI-algebra. 


Proof. Let Ry and Re satisfy, respectively, polynomial identities of degree d, 
and dy. Therefore, by Theorem of Regev 8.1.7, 


en(Ri) < (di — 1)", en(R2) < (de — 1)". 


We choose n such that c,(Ri)en(Re) < n!. This is always possible because 
” is smaller than n! for large n. Let 


{gi(vi,..-,@n) | t= 1,2,...,ce° = en(Ri)}, 


{hj(w1,---,@n) |g = 1,2,...,c7 = en(Re)} 


be bases respectively of P,,(1) and P, (Rez). For every permutation 7 € Sp, 
we consider @_(1).-.€z(n) as an element of P,(R1) and write it as a linear 
combination 


a 
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Un(1) +++ &a(n) = S > Prigi(ar, . .5&n); Bri EK. 
i=l 


Similarly, in P,(R2), 


Eq(1)++-La(n) = So yng hy (es, -25)8n), Yny EK. 
j=l 


Pay attention, that the above two equations are polynomial identities re- 
spectively for R, and Re, i.e. they are automatically satisfied for any 
Uj,.--,Un € Ry, and w1,...,U, € Re, respectively. We look for a multilin- 
ear polynomial identity of degree n for the tensor product R = R, @ Re of 
the K-algebras R, and Ry. Let 


f(®1,---52n) = S- bn@n(1)-+-€x(n) = 0 
TES y 


be the desired polynomial identity for R, where the €,’s are unknown coef- 
ficients from K. Since f = 0 is multilinear, it is sufficient to show that it 
vanishes for arbitrary 


Uy © U1, Ug © Va,...,Un © Un, U1,---,Un © Ry, v1,...-,Un € Reo. 


We calculate f(uy © v1,..., tn © Vp) and obtain 


f(u1 @ U1,---,Un @ Un) = S- Ex (Un(1) © Va(1)) +++ (Ur(n) @ Va(n)) = 
TES y 


= S- Ex (Ur(1) «++ Ur(n)) ® (Vet) ---Ur(n)) = 


TES y 
= S- SOS. bn Bring gi(ur, 62) Un) Ag(U1,..-, Un) = 0. 


TES, t=1 j=l 


We rewrite this equation in the form 


wy (x anit) gi(t1,.-.,Un)Ag(U1,..., Un) = 0. 


t=1 j=1 TES y 


Let us consider the linear homogeneous system 


S > Pritnjér =0,0=1,...,¢,7=1,...,€". 
TES y 


The number of the unknowns €, is n! and the number of the equations is 


ec” = en(Ri)en(Re) < nl. 
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Hence the system has a nonzero solution €,, 7 € S,. Clearly, any nontrivial 
solution of the system corresponds to a nontrivial polynomial identity for 


R= R,@ Ro. 


Additional information about the polynomial identities of the tensor prod- 
uct of two Pl-algebras over a field of characteristic 0 can be found in the paper 
of Regev [225]. For generalizations in positive characteristic see the paper of 
Di Vincenzo [62]. 


8.2 Algebras with Polynomial Growth of Codimensions 


Till the end of the chapter, if not explicitly stated, we assume that the field 
K is of characteristic 0. 

Kemer [138] described the T-ideals with polynomial growth of the codi- 
mension sequence, i.e. the T-ideals T(R) such that ep(R) = O(n*) for some 
k. The result is given in the language of representation theory of the sym- 
metric group discussed in Chapter 12. Another description also due to Kemer 
[139] is given below. In particular, one can see from there that the growth of 
the codimensions of a PI-algebra is either polynomial or exponential. 


Theorem 8.2.1 (Kemer [139]) A Pl-algebra R has a polynomial growth of 
the codimension sequence if and only if R satisfies polynomial identities f, = 
0 and fz = 0 such that f; = 0 is not an identity for the Grassmann algebra 
and fo = 0 is not an identity for the algebra of 2x2 upper triangular matrices. 


For unitary algebras the description of the T-ideals with polynomial 
growth of the proper codimensions y,(2) was given by Nadzharyan [187]. 
For Lie algebras the T-ideals with polynomial growth of the codimensions 
were characterized by Benediktovich and Zalesskiy [28]. 


Exercise 8.2.2 Show that if a Plalgebra R has a polynomial growth of 
the codimension sequence, then R satisfies polynomial identities f; = 0 and 
fe = 0 such that f; = 0 is not an identity for the Grassmann algebra and 
feo = 0 is not an identity for the algebra of 2 x 2 upper triangular matrices. 
If, additionally, R is unitary, then R satisfies the standard identity so, = 0 
and the Engel identity [y,z,...,#] = yad! (2) = 0 for some k and I. 


Hint. Use the results of Chapter 5. If all polynomial identities of R hold 
for the Grassmann algebra FE, then e,(R) > en(£Z) = 2"71, n = 1,2,..5 
similarly for the algebra of upper triangular matrices. Hence the growth of 
the codimension sequence of the identities of R would be exponential. 

Now, let R be unitary. Then R satisfies a proper multilinear polynomial 
identity which does not hold for the 2 x 2 upper triangular matrices. This 
identity is of the form 
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n 
f(@1,...,8n) = So ailti,a1,22,...,8i,-..,tn] t..., aj, EK, 

t=2 
where #; means that x; 1s missing, some of the a;,’s are different from 0 and 
. is for the summands which are products of more than one commutator. 
Replace z; = y and the other x,;’s by x. Similarly, if a proper multilinear 
polynomial identity does not hold for the Grassmann algebra, it is equivalent 

to 

f(e., sey Lon) = [ay, £9] wee [fon—1, LIn + 


+ S- Qj oLo(1) +++ €o(ig) x 
x [€o(io+1) oe »@o(i1)> vo(i,+1) tee Volis)> Uo(igtl1) oe »®o(is) Vo(igt+1) oe »&o(2n)- 
Show that 





S- (signm)[#x(1) oe -La(jx)> Un(ji+1) oe »&ax(jo)> Un (jot yee &x(k)] = 0. 
TES, 


Exercise 8.2.3 Let R be a unitary algebra such that T(R) contains Mo, 
for some k > 1 (where I, is the vector space of all proper multilinear 
polynomials of degree 2k in K(X)). Show that T(R) contains IF, for all 
n > 2k. 


Hint. Let u = [o(1),---,®o(p)]---[---,®o(n)], 7 © Sn, n > 2k, be a product 
of commutators. If some of the commutators is of length larger than 2, e.g. 
the first commutator is of length p > 2, then it is a consequence of 


v = [t1,...,€p-i].--[--.,2n-1] € In-1 C T(R), 


where the length of the first commutator of v is p— 1 and the other commu- 
tators are of the same length as in u. If all commutators of u are of length 2, 
then u follows from 


[ay, r9| _ .[@on-1, Lop] E Top Cc T(R). 


Exercise 8.2.4 Show that if the codimension sequence of the unitary PI- 
algebra R is not of polynomial growth, then 


en(R) > 27! n=1,2,... 


Hint. If yox(R) = 0 for some k, then y,(R) = 0 for all p > 2k and, by 
Theorem 4.3.12 (ii), cn(R) is of polynomial growth. If yo,(R) 4 0 for all k, 
then yn(R) > yn(E) and, hence en(R) > en(#) = 2771. 


Exercise 8.2.5 (Drensky [80]) Show that, if the codimension sequence of the 
unitary algebra R is polynomially bounded, then for some k € NU {0}, 
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en(R) = agn® + O(n*-!), 04 ay €Q. 


Hint. If ¢n(R) is polynomially bounded then, by Exercise 8.2.4, yp(R) = 0 
for p sufficiently large. Let k be the greatest integer such that y,(R) 4 0. By 
Theorem 4.3.12 (ii), ¢,(R) is a polynomial of degree & in Q[n]. 


Exercise 8.2.6 (Specht [245]) Prove that the dimension of the vector space 
I, of the proper multilinear polynomials of degree n in the free associative 
algebra is 





lo. wl 
sn = dim, = nt (1 nto ate t(-) ). 


n! 


Hint. Apply Theorem 4.3.12 (ii). Use the formula ¢(t) = e'¥(¢) for the ex- 
ponential generating functions (which holds not only for relatively free, but 
also for free associative algebras), where c, = n! and é(f) =14+¢+t?+... 
Hence 

A(t) =e 14+t+eP+P4+...). 


Decompose e~* into a series and multiply it by 1+¢+@4+.... 


Exercise 8.2.7 (Drensky and Regev [96]) Show that in Exercise 8.2.5 


Lo otoi, . 1 
an <1 To! ot ate t(D" p, 
1 


which, for & sufficiently large, is close to e~*. 





Hint. Apply Exercises 8.2.5 and 8.2.6 and use that a, < 4p. 


The next exercise gives a construction which is a special case of the wreath 
product of Lie algebras introduced by Shmelkin [240]. 


Exercise 8.2.8 Let U = U(F(z2)) be the universal enveloping algebra of 
the free nilpotent of class two Lie algebra / = F(z) of infinite rank. Let 
M be the free right U-module generated by one element a. Make the direct 
sum G= M 6 F a Lie algebra assuming that 


[au + fi, au2t fo] = a(urfe — uofi) + [fi, fel, wi, ue EU, fi, fe EF 
Show that G belongs to the variety AN» defined by the polynomial identity 


[[@1, ©2, x3}, [w4, U5, rel] = 0. 


Hint. Recall that F is spanned by x; and [x,;, #,] and satisfies the relations 
[x;,x;,,] = 0. Check the defining identity of 292 on the basis elements of 
G. 


8.2 Algebras with Polynomial Growth of Codimensions 117 
The following exercise presents a modification of an example of Volichenko 
[263] which shows one more difference between the behaviour of the codimen- 


sions of associative and Lie algebras. 


Exercise 8.2.9 Show that the codimension sequence of the variety of Lie 
algebras 2(9t2 grows faster than any exponential function. 


Hint. Consider the polynomials in the free Lie algebra 


[[Zon $1, €In4+2, Cans 3], [Bn t1; ®o(1)]; [fn42, ®o(2)], sey [@on, ®o(n)l], gE Sn. 





Find linearly independent images of these polynomials in the algebra G from 
Exercise 8.2.8. The number of the polynomials is n! and con43(WIte) > ni. 


Remark 8.2.10 The variety of Lie algebras 29%. has another remarkable 
property discovered also by Volichenko [264]. Its codimension sequence grows 
faster than any exponential function but any of its proper subvarieties has 
an exponential growth of the codimension sequence (char = 0). 


Explicit formulas for the codimensions of a given algebra R are known in 
few cases only. As we have seen, a lot of information about the polynomial 
identities of an algebra can be obtained from the asymptotic behaviour of 
the codimension sequence. The main results here are due to Regev who did 
a lot of work in this direction. In particular, Regev determined the exact 
asymptotics of the codimensions of the & x k matrix algebras over a field of 
characteristic 0 (see his survey article [226]). The approach of Regev is based 
on representation theory of the symmetric group, invariant theory of matri- 
ces, combinatorics and analytic methods, for example evaluating of multiple 
integrals. 

As a first approximation of the asymptotic behaviour of a given sequence 
of positive numbers ao, 41, @2,... one may consider the radius of convergence 
r(a) of the generating function a(t) = S749 @nt”. Since 


(«) = ——__— 
r(a) = = , 
lim sup, 465 %/@n 
we can measure the growth of ao, a1,@2,... by limsup,_,., %/an. There is 


a conjecture that limsup, _,,, %/¢n(A) is integer for any Pl-algebra R. This 
is known for many important Pl-algebras. Recently Giambruno and Zaicev 
[116] have established that limsup,_,., %/¢n() is integer for any finitely 
generated Pl-algebra R. Their proof uses the theory of Kemer (see Section 
8.4) combined with the asymptotic methods of Regev. 
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8.3 The Nagata-Higman Theorem 


In this section we shall prove the classical Nagata-Higman theorem for the 
nilpotency of nil algebras of bounded index. It was first proved in 1953 by 
Nagata [188] over a field of characteristic 0 and then in 1956 by Higman 
[127] in the general setup. Much later it was discovered that this theorem 
was first established in 1943 by Dubnov and Ivanov [99] but their paper was 
overlooked by the mathematical community. It turns out that this theorem 
has many applications not only in the theory of Pl-algebras but also in (both 
commutative and noncommutative) invariant theory and structure theory of 
rings. We refer to the survey article [107] or the book [108] by Formanek for 
relations with invariant theory. Of course, since we consider nil algebras, we 
assume that the algebras are nonunitary. 


Exercise 8.3.1 Show that, if chark 4 2, then the identity 2? = 0 implies 
the identity ,;%2%3 = 0. 


Hint. There are several possible solutions. Try the following. Linearizing 2? = 
0 we obtain 
e(@1, 22) = 21@2 + £221. 


Consider the identities 


€(x1 82, £3) = (21 @2)a3 Tr #3(%1 22) =0, 





€(€9a3,€1) = (€oa3)a1 + £1 (t2%3) = 0, 
€(3%1, €2) = (#3%1)€2 + to(a3%1) = 0 
as a linear homogeneous system with unknowns 22923, £2%3%, and £32129. 


Show that the determinant of this system is different from 0, hence the only 
solution of the system is 


U1 C93 = COUZBU, = U3X1, LQ = 0. 


Theorem 8.3.2 (Dubnov-Ivanov-Nagata-Higman [99, 188, 127]) Let R be 
a nonunitary associative algebra over a field K of characteristic 0 and let 
R satisfy the polynomial identity x* = 0. There exists an integer d = d(k) 
depending on k only such that R is nilpotent of class d, i.e. R satisfies the 
polynomial identity x, ...%q = 0. 


Proof. We shall work modulo the identity «* = 0, i.e. in the relatively free al- 
gebra F (91) of the variety of nonunitary algebras N defined by the polynomial 
identity «* = 0. The partial linearization of «* = 0 is 


flay) = akty + at ye +... aya? + yar} = 0. 
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Simple algebraic manipulations give 





f(x, y2t)z* j LL yk Lygk Ti gk Zayed gh IN 


j kD kj jkR-L k-j-1 
ti... ayst ati" IO 8 4 yet ai gh I" = 0), 


k-1 k-2 
So f(a, yet)! = kako lygk-h So aly f(z,a8 77!) = 0. 
j=o i=0 


Since f = 0, we obtain 
ah lykol — 0. 


1 


By induction, the identity «*-! = 0 implies nilpotency. Hence, for some 


d= d(k—1) 


k-1 k-1 
L1...Ug= ) ab; Cj, Vaya... @2d41 = ) UjU; Wy, 
J 


a 


where b;,v; € F(M) and a;,¢;, uj, w; are constants in K or polynomials in 


F(9t). Therefore 


k-1 k-1 
fy... aldgiLd42---Cedp1 = ) a;(b; (ci@ap1Uj)Y; jw; = 0 
i) 


and the algebra F'(91) is nilpotent. 


Exercise 8.3.3 Show that in the Nagata-Higman theorem the class of nilpo- 
tency d(k) is bounded by 2* — 1. 


Hint. For k = 1 one obviously has d(1) = 1. See that in the proof of the 
Nagata-Higman theorem given above d(k) < 2d(k — 1) +1. 


Exercise 8.3.4 Show that the proof of the Nagata-Higman theorem works 
also when chark = p>k. 


Hint. Since the field has more than k elements, we can apply Vandermonde 
arguments and obtain that the partial linearization f(x,y) = 0 is a conse- 
quence of 2* = 0. Since chark = p > k, the identity ka*-'yz*-' = 0 implies 
ak—lyek-l — 0, 


Exercise 8.3.5 Show that the condition chark = p > k in Exercise 8.3.4 is 
essential. Construct an algebra which satisfies the identity x? = 0 and is not 
nilpotent. 


Hint. Consider the polynomial algebra F,[X] in infinitely many variables 
modulo the ideal generated by all «?, 2; € X. Show that the (nonunitary) 
subalgebra generated by X satisfies «? = 0 but is not nilpotent. 
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For many purposes it is important to know the exact value of the class 
of nilpotency d = d(k) in the Nagata-Higman theorem. The upper bound 
given in the proof of Higman [127] is d(k) < 2* — 1 (see Exercise 8.3.3). 
The best known upper bound is due to Razmyslov [219]. Using his method 
to study polynomial identities of algebras (see his book [221]), Razmyslov 
proved that the polynomial identities of M,(K) follow from «* = 0 and 
obtained the bound d(k) < k?. On the other hand, Kuzmin [160] showed 
that there exists an algebra which is nil of index k and does not satisfy the 
identity x1 ...2%m = 0, where m= sk(k +1)—1. In this way 


k(k+1 

wee < d(k) < k?. 
Problem 8.3.6 Find the exact value d(k) of the class of nilpotency of nil 
algebras of index k over a field of characteristic 0. 
Conjecture 8.3.7 (Kuzmin [160]) The exact value d(k) of the class of nilpo- 


tency of nil algebras of index k over a field of characteristic 0 is 


a) —MEED, 


The only values of d(k) are known for & < 4: For & < 3 they were obtained 
by Dubnov [98] in 1935: 
d(1) = 1, d(2) = 38, d(3) = 6, 


and this agrees with the conjecture of Kuzmin. Very recently, in 1993, 
Vaughan-Lee [261] proved that 


d(A) = 10, 


confirming the conjecture of Kuzmin also for k = 4. 

We complete this section with an example of Kemer showing that in 
positive characteristic all polynomial identities of an algebra may not depend 
“too much” on the multilinear polynomial identities. 


Exercise 8.3.8 (Kemer [143]) Let chark = p > 0 and let R be a Pl-algebra 
with a basis as a vector space {r; | 7 © J}. Consider the algebra 


CH= KG; lie 07 =1,2,..1/(G; 6 47 =1,2,..), 


i.e. the polynomial algebra modulo the ideal generated by all squares of the 
variables. Let S be the nonunitary subalgebra of C @x R generated by 
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Show that the algebras R and S satisfy the same multilinear polynomial 
identities and every element of S is nil. 


Hint. A multilinear polynomial identity f(a1,...,¢%,) = 0 holds for R if 
and only if f(7,,...,7:,,) = 0 for all basis elements r;,,...,7r;, (including 
repetitions) and this is if and only if 


F(&11 @ri,,- 2) E5an @r5,) = 0. 


8.4 The Theory of Kemer 


It is well known that the Jacobson radical of finite dimensional algebras 
and finitely generated commutative algebras is nilpotent. The theorem of 
Razmyslov-Kemer-Braun states that the same holds for finitely generated PI- 
algebras. (Again Pl-algebras enjoy a nice property as nilpotency of the radi- 
cal.) Razmyslov [220] proved that over a field of characteristic 0 the Capelli 
identity implies the nilpotency of the radical. Kemer [140] established that 
every finitely generated Pl-algebra satisfies some Capelli identity and in this 
way the result was obtained over a field of characteristic 0. Finally, Braun 
[38] proved the theorem in the maximally general case of finitely generated 
Pl-algebras over a commutative noetherian ring A. Here we prove another 
theorem of Kemer [137] which states that the standard identity implies the 
Capelli identity. The proof of the theorem gives also some (very slight) idea 
about the methods developed by Kemer in his structure theory of T-ideals 
discussed in the end of this chapter. We divide the proof in several exer- 
cises with detailed hints, but leaving also some work to the reader. Below we 
assume that chark = 0. 

Let X UY = {a1,%9,...,41, yo,...} and let A(X UY) be the free asso- 
ciative unitary algebra freely generated by X UY. We consider the canonical 
homomorphism 


vy: K(X UY) 9S K(X UY)/(awa; + ejwe; |w © K(Y),i,7 = 1,2,..). 


We denote v(a;) = e; and v(y;) = yj, 7 = 1,2,... Hence the algebra A = 
v(K(X UY)) is generated by e),e2,... and y, yo,..., satisfies the relations 


e;we; = —e;we;, i,j =1,2,...,.weE K{Y), 
and may be considered as a generalization of the Grassmann algebra. 
Exercise 8.4.1 Show that the algebra A has a basis as a vector space 
WEF, WI2E{, WZ. Wplj, Wpt1, 1 < tg <<... < dp, 


where w1,..., Wp41 are monomials in K(Y), p=0,1,2,... 
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Hint. Use arguments as in the proof of Poincaré-Birkhoff-Witt Theorem 
1.3.2. (Compare with Exercise 1.3.11.) Pay special attention to the analogue 
of Lemma 1.3.3. Usually the uniqueness of the reduced form is based on the 
so called Diamond Lemma. See the paper by Bergman [33] for the Diamond 
Lemma and its applications in ring theory. 


Exercise 8.4.2 Let 


dy (x1, se ey ny Y1y-- sy Yn+1) = S- (signa) yi ®o(1)Y2 on ‘Yn®a(n\Yn+1 
TES y 


be the Capelli identity in n skew symmetric variables and let 
Dy = span{dy(%1,...,8njh1,...,hngi) | ai Ee KY}, 


Gy, = span{hyeyhge2.. Aner hag | hi © K(Y)}. 
Show that the map ¢,, defined by 


on : dy (x1, see Bs Ay, sey An41) —_ hyey hoe. . Anenhnsqi 
induces an isomorphism of vector spaces ¢, : Dn —> Gn. 
Hint. Show that D, and G,, have bases respectively 
{dn(%1,...,8nj W1,---,Wngi)}, {Wie2w2ee ... Wren Wn4i} 


where wy,...,Wpr41 run on the set of all monomials in K(Y). 


Exercise 8.4.3 Let U be a T-ideal of K(X UY) and let U(A) = v(U) be its 
image in A. Show that 


On(U AN Dn) = U(A)NGy. 


Hint. If 


Fler...) ni Mts Ym) = >> andn(a1,---.@ng hay... hn41) EUND,, 
h 


a, €K, hy © K(Y), then v(f) € U(A) and 


Uf) = Pers. ni Mts. Ym) = Y- andn(er,---5€njhay---s Rng) = 
h 


=n! S“anhyerhze2...hnénhingi = nlon(f) € Gn- 
h 


If v(f) Ee U(A) NG), then 


S- (signe) f(eo(1); +++) €o(n), Y1,-- + Ym) = 


TES y 
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= dn ( S- (signa) f(%o(1), teey Lo(n)s Yly sees 7) } 


TES y 


S- (signa) f(@e(1)) +++ ®o(n)s Yio ++ +1 Ym) CUN Dn. 
TES y 


Exercise 8.4.4 Let U be a T-ideal of K(X UY). Show that: 

(i) The standard polynomial s,(#1,...,#,) belongs to U if and only if the 
product e,...e, belongs to U(A). 

(ii) The Capelli polynomial d,(21,...,@n391,---,Yn41) belongs to U if 
and only if Ae; AegA... Ae, A is a subset of U(A). 


Hint. Use Exercise 8.4.3 and the equations 
Sh(€1,---,€k) = kley...€n, 


dn(€1,---5€nj W1,-- +, Wngi) = nlwye wees... Wren Wn4i- 


Theorem 8.4.5 (Kemer [137]) Over a field of characteristic 0 the standard 
identity 

Sh (1,...,%) =0 
implies some Capelli identity 


dy (x1, se ey ny Y1y-- sy Yn+1) = S- (signa) yi ®o(1)Y2 on ‘Yn®a(n\Yn+1 =0. 
TES y 


Proof. We use the same notation as above. Let U be the T-ideal generated in 
K{X UY) by the standard polynomial s,(a1,..., #,). Replacing, if necessary, 
k by k+1, we may assume that & = 2p is even. Let J, be the vector subspace 
of A spanned by all products e;, ...e;,, 41 <...< tg. By Exercise 8.4.4 (il) 
it is sufficient to show that, for some n, 


(AL, A)” CU(A) = AlpA+U(A). 
If, for every g = 1,2,...,2p —1, there exists a t such that 
(1, A)’ © Aa A+ UA), 


then 
(AI, A)! = A(, A)’ CABLA+U(A), 
((AI,A)‘)' C (AI, A)! + U(A) = A(Ip A)! + U(A) C AIZA+ U(A),..., 
(AN A)O" C AlpA + U(A) = U(A). 


We shall show that the subalgebra of A generated by /,A modulo the ideal 
Alg4i1A+U (A), is nil of index bounded by p+1. By Nagata-Higman Theorem 
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8.3.2, this implies that ([jA)’ C Alj41A + U(A) for some t = t(qg) and the 
proof will be completed. Let 6,,..., p41 € Ig, v1,...,Up € A. We may assume 





that 5; are monomials of degree g in €1, €2,... For g = 1 we obtain 
Sop (b1, v1, b2, V2,-.., bp, Up) = 
= S- S- (signoT)bo(1)Ur(1)bo(2) Ur(2) . bg (p) Ur(p) + 
TES) TESy 
+(—1)? S- S- (signoT)v7(1) bo(1)Ur(2) 0(2) . -Ur(p) a(p) Sn U(A), 
TES) TESy 
where ... denotes the sum of all products with two consecutive 6;6; and 


...€ Aly A. Multiplying from the right by 6,41, we obtain that 


u= SPS (signer) be(ayer(1)bo(2)Ur(2) «+ -be(p)Ur(p) bp 41 
oESy TESy 


is in AlgA +U(A). Recall that q = 1, ie. the 5,’s are equal to some e;,. We 
use the relations 
bewb, = —b whe, 


which hold for all w € A. (In the definition of A we required these relations 
for w € K{Y) only. Why they hold for any w € A?) We obtain for the above 
expression of u € AlzA + U(A) that 


= Ph YS | (bo(1)Vo(1))(bo(2)%9(2)) «+ (bo(wy%o(p) b+ = 
oESy 


= A(biv, weg bp Up) bp 41 € AlbA+ U(A), 


where 


h(e1,..., 2p) = S- ®o(1)---®o(p) 


oESy 


is the complete linearization of the polynomial «?. Every expression (2; + 
... + 2,)? can be written as a linear combination of elements, obtained by 
substitutions in h(x1,...,2%p). Hence 


(divi tbeve+...+bpvp)PTT ESO (bru tbovet..+bp Up )Pb:A C AL,A+U(A). 


i=l 


Now, if g > 1, we write each b; in the form 6; = e,,b/. We consider the 
standard polynomial 


Sop (€j, 5 (b01), Cj) (b5v2), vey Ons (b,,up)) € U(A) 
and apply considerations similar to those for g = 1 in order to obtain 


(bi 04 + bove +...4+ bp vp PTH E Algpi At U(A), bj € Lg, ¥; EA. 
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Finally, we give a very brief account of some results of Kemer [141] which 
show that the T-ideals of A(X) behave as the ideals of the polynomial alge- 
bras in several variables and show the importance of the matrix and Grass- 
mann algebras for the theory of Pl-algebras. We assume that the field is of 
characteristic 0. 


Definition 8.4.6 The T-ideal S of K(X) is called T-semiprime if any T- 
ideal U with U" C S for some n is included in S, ie. U C S. The T-ideal 
P is T-prime if the inclusion U;U. C P for some T-ideals VU, and Uy implies 
U, C Por, CP. 


Clearly, the above definition is similar to the usual definition of semiprime 
and prime ideals, but instead of ordinary ideals, we consider T-ideals only. 


Exercise 8.4.7 Show that every T-prime T-ideal is T-semiprime. 


Hint. If P is T-prime and U" C P for some T-ideal U, then U? = UU"! 
and U C Por U"-! CP. 


Let & be the Grassmann algebra of an infinite dimensional vector space. 
It has a natural Zo-grading EF = Ey @ Fy, which we have already used in the 
Rosset proof of Amitsur-Levitzki Theorem 7.1.8. Let p,q be positive integers, 
p > q. Consider the vector subspace of M,4,(£) 


b 
Myg={ (0b) [ae Myla), 8 © Mprg(B1)oe © Maxp( Ei), © My(Eo) 


where Mp yq(E1) is the vector space of all p x ¢ matrices with entries from 
1, similarly for the other sets of matrices. For example, 


EZ€1 €2€4 al 
1 EZ€,E2E€4 ELE4ED 
€2 €2 €1€2 
is an element of Mz, with 


E3€1 E€9€4 ey, 
a= € Mo(Eo), b= € Mox1(F1), 
1 €3€1€0€4 €1€4€2 


c= (és €2) € Mix2(F), d= (€1€2) € Mi(£o). 
Exercise 8.4.8 Show that M, _ is an algebra. 


Exercise 8.4.9 Show that the T-ideals T(M,(4)), T(Mg(E)) and T(M, 4) 
are T-prime for all & > land p>q> 1. 
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Hint. Let P = T(R), where R is one of the algebras M,(K), M;(/) and 
My q. Let U; and Uz be T-ideals which are not contained in P. Choose 
fi € U1, fo © Us, fi, fo € P. We may assume that fi = fi(v1,...,%m), 
fo = fol@m+a,..-,@n). (Why?) Fix ri,...,7%m,5mta,---;8n © R with 
filti,...,%m) #90, folsmta,--.,$n) # 0. (If R = M,(£) or R = Mya, 
choose r;,8; in such a way that they depend on disjoint sets of generators 
{e1,...,@a} and {éa41,...,e5} of &.) Find a matrix rmii € R satisfying 
filti,...,?%m)’m4ifo(Smta,---;8n) #0. Hence U1 Ue is not contained in P. 


One of the main theorems in the structure theory of T-ideals developed by 
Kemer gives a list of all T-semiprime and T-prime T-ideals and a description 
of the T-ideals similar to that of the ideals in the polynomial algebra. An 
exposition of his theory can be found in his book [142] or in the original 
paper [141]. 


Theorem 8.4.10 (Kemer [141]) (i) For every T-ideal U of K(X) there exists 
a T-semiprime T-ideal S and a positive integer n such that 


sDuUDS". 


(ii) Every T-semiprime T-ideal S is an intersection of a finite number of 
T-prime T-ideals Q1,...,Qm 


S=QiN...NQm. 


(iii) A T-ideal P is T-prime if and only if P coincides with one of the 
following T-ideals: 


T (My (K)), T( Ms (E)), T(Mp,q), (0), A(X), 


where M),(E) ts the k x k matrix algebra with entries from the Grassmann 
algebra FE. 


9. Finitely Generated PI-Algebras 


In this chapter we apply combinatorial theory of words to the study of finitely 
generated Pl-algebras. After discussions on the Burnside problem in group 
theory and the Kurosch problem in ring theory we present the methods of 
Shirshov based on the notion of height of words. It leads to purely combi- 
natorial proofs of the theorems of Levitzki and Kaplansky which give posi- 
tive answers to the Kurosch problem for Pl-algebras. The approach of Shir- 
shov brings also some quantitative results as the theorem of Berele that the 
Gelfand-Kirillov dimension of a finitely generated PI-algebra is finite, which 
makes the finitely generated Pl-algebras close to finitely generated commuta- 
tive algebras. On the other hand, we give some exotic examples of PI-algebras 
with prescribed Gelfand-Kirillov dimension. The main results hold both for 
unitary and nonunitary algebras and over any field. 


9.1 The Problems of Burnside and Kurosch 


In 1902 Burnside [44] asked his famous problem which has been one of the 
main driving forces of the theory of infinite groups for almost 90 years. There 
are three different versions of the Burnside problem. 


Problem 9.1.1 (Burnside Problem) Let G be a finitely generated group such 
that every element of G is of finite order. 

(i) Is the group G finite? 

(ii) If there exists a positive integer & such that every element g of G 
satisfies the equation g* = 1, is G finite? (Restatement in the language of 
varieties of groups: Is the variety of groups, defined by the identity «* = 1, 
locally finite?) 

(iii) (Restricted Burnside Problem) If G is finite, generated by m elements 
and satisfies the condition g* = 1 for all g € G, does there exist a bound for 
the order of G in terms of m and k? (In the language of varieties: Is the 
variety generated by all finite groups of exponent & locally finite?) 


It is very interesting that the negative solution of Problem 9.1.1 (i) was 
obtained by Golod [117] using ring theoretic constructions. Now there are 
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many counterexamples based on different approaches. Maybe the simplest 
ones are of Grigorchuk [119] who constructed a finitely generated 2-group 
which is not finite and of N.D. Gupta and Sidki [121] based on graph theory. 
Another easy counterexample of Aleshin [7] involving automata theory is 
given in the third and fourth Russian editions of the book by Kargapolov 
and Merzlyakov [135]. 

The negative solution of Problem 9.1.1 (ii) was obtained by Novikov and 
Adyan [195] for & odd and sufficiently large (in the book by Adyan [6] the 
result is given for odd k > 665). Other counterexamples were discovered by 
Olshanskii using his geometric approach to groups presented by generators 
and defining relations. In particular, for every prime p > 10!° Olshanskii 
constructed the so called Tarski-Olshanskii monster of exponent p. This is an 
infinite group in each all proper nontrivial subgroups are of order p. 

For many small & the answer to Burnside Problem 9.1.1 (ii) is still un- 
known. For example, it is still an open problem whether every 2-generated 
group satisfying the identity 2° = 1 is finite. 

Finally, using methods of Lie algebras satisfying the Engel condition (i.e. 
a special polynomial identity) Kostrikin (see his book [154]) proved that for 
p prime the order of the m-generated finite groups of exponent p is bounded. 
Later the problem (also in the affirmative and also using a ring theory ap- 
proach) was solved by Zelmanov [269, 270] for any k. 

The restatement of the Burnside problem for algebras is known as the 
Kurosch problem. 


Problem 9.1.2 (Kurosch Problem [159]) Let R be a finitely generated as- 
sociative algebra such that every element of FR is algebraic. 

(i) Is the algebra FR finite dimensional? 

(i) If R is nil, is it nilpotent? 

(ii) If every element of R is algebraic of bounded degree, is R finite di- 
mensional? 

(ii’) If R is nil of bounded index, is it nilpotent? 


For Lie algebras the “nil” condition is replaced by the Engel condition. 


Problem 9.1.3 (i) Let G be a finitely generated Engel Lie algebra, i.e. for 
every two elements g,h € G there exists an n = n(g,h) > 0 such that 
gad" h = 0. Is G nilpotent? 

(ii) If the Lie algebra G satisfies an Engel condition of bounded degree 
(i.e. the polynomial identity [a#,y,...,y] = 0), is G (locally) nilpotent? 


The negative answer to Problem 9.1.2 (i’) (and hence also to (i)) was given 
by Golod and Shafarevich [118]. They used some quantitative approach to 
construct a series of counterexamples, which serve also as counterexamples 
to Problem 9.1.3 (i). Concerning Problem 9.1.2 (ii’) we have seen that, if the 
characteristic of the field is 0 or sufficiently large, Nagata-Higman Theorem 
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8.3.2 gives that the algebra is nilpotent even without the condition that it is 
finitely generated. 

The nil algebras of bounded index satisfy the polynomial identity «* = 0 
for some k. Similarly, if all elements of the algebra are algebraic of bounded 
degree k, then 1,a,a°,...,a* are linearly depended for any a € R and this 
implies that R satisfies the identity of algebraicity, as the & x k matrix algebra. 
Since we have already seen (for example in the previous Chapter 8) that the 
class of all PlI-algebras has some nice properties, one may expect that the 
Kurosch problem has a positive solution for Pl-algebras. 


Problem 9.1.4 (The Kurosch problem for Pl-algebras) If R is a finitely 
generated Pl-algebra and every element of R is algebraic (or nil), is R finite 
dimensional (or nilpotent)? 


For nil PI-algebras the problem was answered (in the affirmative) by Lev- 
itzki [172] and the general problem for algebraic PI-algebras was solved, also 
positively, by Kaplansky [132]. Both the proofs involve structure theory of 
rings and can be found in the book by Herstein [126]. 

We shall mention also three results on Lie algebras related with Prob- 
lem 9.1.3 (ii). Kostrikin (see [154]) proved that the Lie algebras satisfying 
the Engel condition vad*y = 0 are locally nilpotent, where chark = 0 or 
chark = p > k. Razmyslov [216] constructed an example of an infinitely 
generated Lie algebra satisfying the Engel condition rad’~*y = 0 over a 
field of characteristic p > 3, which is not nilpotent. Finally, Zelmanov [268] 
established that the Engel identity implies the nilpotency of the algebra if 
chark = 0. One may see an exposition of the relations between similar prob- 
lems in groups, associative algebras and Lie algebras in the book by Zelmanov 


[271]. 


9.2 The Shirshov Theorem 


In this section we consider the famous Shirshov theorem [238]. There are 
several expositions of this theorem in the literature. For example, a version 
of the original proof of Shirshov can be found in the book by Zhevlakov, 
Slinko, Shestakov and Shirshov [272]. We give the proof of A.Ya. Belov (see 
the survey article by Amitsur and Small [13]). 

We fix the number m > 1 of the generators of the free associative algebra. 
Let W = (a1, &2,...,€m) be the set of all monomials (words) in K(X,,). It 
is called the free unitary semigroup of rank m and is the free object in the 
class of all semigroups with unity. For w = x;,%;,...%;, € W we denote by 
|w| the length (or the degree) of w. 


Definition 9.2.1 We introduce a partial lexicographic ordering on W assum- 
ing that 7) < %2 <...< @m, and then extending it on W in the following 
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way: 

Li, Xi, > X45, ar oe 
if and only if 7; = j1,...,%% = Je, te+i > Jeri for some k > 0. Two words u 
and v are incomparable if one of them is a beginning of the other (i.e. u = vw 
for some w € W, w # 1). A finite subset of W is called incomparable if it 
contains two incomparable words. 


Definition 9.2.2 The word w € W is called d-decomposable if it can be 
written in the form 
WwW WoW]... WdWd4+1, 


(some of the words wo and wa41 may be empty) and 
WoW] ...WdWd41 > Wo We(1) +++ Wold) Wat 


for every nontrivial permutation o € Sq. 


For example, 
W = Uo 39h U9UZL Lod, = (Hoe) (H3X2e1 Lo) (3H 1X2) (%4%1) 
is a 2-decomposition, with wp = %2x1 and w3 = r4x%1, because 
(9X1) (%3%9%1%2)(#3%182)("4a1) > 


> (901) (%3%1%2)(@3%2%1 Ho) (24%) = WaW2W1 Ws. 


This word has also a 4-decomposition 
w = (%9%1)("3%2%182)(%3%1)(t284)(1), Wo = FQX1, Ws = 1. 


(Prove that this is really a 4-decomposition!) 


Lemma 9.2.3 If w € W has the presentation w = pq = rp for some 
nonempty words p, q andr, then w has the form a” or abab...aba = (ab)"~ta 
for some a,b #1 andn>1. 


Proof. (i) If |p| < |r|, then r = pb and hence w = pbp, i.e. w = aba for a = p. 

(ii) If |r| < |p|, then p = rp,. Hence w = rpig = rrp; and pig = rpy. If 
lr| < |pil, then py = rpg and poq = rpz. We continue this process until we 
obtain py = rpe4i and praig = rpe4i, where |pg4il| < |r|. In this way we 
obtain: 

(a) If pegi =1, then p= r*t! and w =r*t? = a'+? for a=r. 

(b) If pega A 1, then peyig = rprqi and |p_4i| < |r|. From the case (i) 
we obtain for ¢ = pp4i, r= ab, that p= r*+'pp4, and 


W = ProoPey1 = (ab)**? a, 


This completes the proof of the lemma. 
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Lemma 9.2.4 Let 
VU >= WaAWW {WW ...Wd-, WW 


be a word such that the subword w has d different comparable subwords. Then 
the word v is d-decomposable. 


Proof. Let w = a;vjb;,i = 1,...,d, and v, > vg >... > vg. Then v has the 
following d-composition 


v= (wo a1) (v1 6) wy a2) (vebsweaz) sae (vg_-1ba-1 Wa-14a) (Vaba) Wags, 


because 1 > tg > ... > ta, where t; = v;b;wjajzi, 2 = 1,...,d-—1, and 
ta = vaba.- 

Example 9.2.5 Let p and g be comparable words. Then the word w = p*~!q 
contains d comparable subwords. If p > q, then 


p’q>pt7q>...>pq>¢. 


If p < q, then 
qa cL <pg <q. 


Definition 9.2.6 Let w € W and |w| > d. We write 








Ww WI €9W92 see E€qWd, 
where e; is a beginning of w and |e;| = ¢—1. Then |w;| = |w|—%+1. We call 
the words w 1, we,...,wa d-ends of w. 


Lemma 9.2.7 Let |w| > d and let the d-ends of w be incomparable. Then 
w = abc, where |a| + |b] < d, t > 1 and either e = 1 or ¢ ts a beginning of 
b. If |w| > dk, then t > k, and, in particular, w contains a subword b* with 
|b] < d. 


Proof. Let w; and w; be two incomparable d-ends of w, i < 7. Therefore 
w = aw; and w = abw;, hence w; = bw;. Since w; and w; are incomparable, 
this means that w; = w;u and by Lemma 9.2.3 (w; = bw; = w;u) we obtain 
w; = b'c, where either ¢ = 1 or ¢ is a beginning of 6, i.e. w = ab‘c. Clearly, 
ja] + |b} = j —1 <d. If dk < |w], then 





dk < Jal + |b] + @— 1] + lel < d+ (€— Db] + Jel < d+ tb] < (t+ 1d 


and k <t. 


Lemma 9.2.8 Let w € W and |w| = kd. If w does not contain a subword b*, 
|b] < d, then w = vu, the d-ends of v are comparable and v belongs to a finite 
set of elements S such that the number of the elements |S| of S is bounded 


by 
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d d 
[S| < s(d,k) = ) (k —1)m 


Proof. If the d-ends of w are not comparable, then by Lemma 9.2.7, the word 
w contains a subword 6*, |b| < d. Hence the d-ends of w are comparable. 
Let v be the beginning of w of minimal length such that the d-ends of v 
are comparable. By the definition of d-ends (see Definition 9.2.6), |v| > d. 
Let v = gx for some letter x. Then either |g| < d or the d-ends of g are 
incomparable (because q is a beginning of w which is shorter than wv). In the 
second case Lemmas 9.2.3 and 9.2.7 give that q = a(cb)'c, where c,b £ 1, 
ja|+|6|+|c|< dandt > 1 org = ab‘, where 6 F 1, and |a|+|b| < d. The case 
t > k is impossible, because w contains (bc)? and |be| = |b] + |e] < d. Hence 
t < k. We denote by S the set of all words written in one of the following 
two forms: 

(i) v = a(cb)'ex, where c,b #1, 1<¢<k—-1, Ja|+ |b] + |e] < d—1 and 
the letter x is different from the first letter of 0. 

(ii) v = ab'x, whereb41,1<t<k-1, jal+|b|<d—1 and visa letter 
different from the first letter of 6. 

Let us estimate the number of the words of the first kind. (In some places 
we have inequalities because we are not sure that the corresponding presenta- 
tion is unique.) Let / = |a|+ |6|+|c|. Therefore 2 <1 < d—1. The two integers 
|a| and |a| + |e] determine |a], |b] and |c|. Hence we choose two different in- 
tegers |a| and |a|-+ |c| between 0,1,2,...,/— 1, and we have (5) possibilities 
for |a|, |b| and |e]. For fixed J, the possibilities for v are (5) ml (k —1)(m-1) 
(m letters in each of the / positions of a, 6 and c; k—1 fort =1,...,k-1 
and m — 1 letters x different from the first letter of 6). The summation for 
i = 2,3,...,d—1 gives that the number of the words of the first kind is 


bounded by . 
YZ) mite = Hom 0). 


i=2 





Similarly, we obtain that the number of the words of the second kind is 


bounded by 
d-1 
So Im! (k = 1)(m = 1). 
f=1 


Hence 


oe (k — 1)( )+ Sol (k —1)(m—-1) = 


=2 


- y oe (3') sainen(S < (k-1) (5) mt 


t= 
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The following theorem is the version of Belov of the combinatorial theorem 
of Shirshov [238] which is the key result in our approach to finitely generated 
Pl-algebras. 


Theorem 9.2.9 Let w be a word in the free semigroup (41,...,&m), m> 1, 
and let k and d be fired integers with k > d > 1. If the word w ts not d- 
decomposable, then it can be written in the form 


_ ky ko kp 
W = Covy C1 U5* .. Up" Cp, 
where 
d*k?m4 
lui| <d, ki > kyr < dm’, 5 "|ci| < d?ks(d,k) + dk(r +1) < = 


i=0 


the words v; and c; have no common beginning and, if c; = 1, then v; and vi4i 
have no common beginning. Here s(d,k) was defined in the previous Lemma 


9.2.8. 


Proof. If |w| < d?ks(d,k), then we assume that co = w and complete the 
proof. If |w| > d’ks(d,k), then we may write w in the form 


Wo = ULW1UaWe... UpWeUt+i, 


where t = ds(d,k), |w;| = kd and the u,’s are arbitrary. By Lemma 9.2.8, if 
no w; contains a subword 6*, |b| < d, then w; = v;2;, where v; is in the set 
S defined in Lemma 9.2.8 and v; contains d comparable subwords. Since t = 
ds(d,k) > d|S|, some v; appears at least d times in w and by Lemma 9.2.4 the 
word w is d-decomposable. This contradicts with the assumption that w is not 
d-decomposable. Therefore some w; (and hence also w) contains a subword 6*, 
|b| < d. Let co be the shortest beginning of w such that w = court wi, where 
lui] << d, ky > k and the words v, and w; have no common beginning. The 
property that v,; and w , have no common beginning can be always arranged. 
If we assume that for some presentation w = cove wr, lui] <d, ky > k, the 
words v, and w, have the same beginning, e.g. vj = pg and w, = pr, where 
q and r have no more a common beginning, then w = cop(qp)*!r and now gp 
and r have no common beginning. If w; contains a subword 6*, |b| < d, then 
we choose c; such that w, = c, vs? wo and c, is with minimal possible length. 
As above, we may assume that vo and wa have no common beginning. We 
handle wz in the same way as wy), etc. and obtain the following form of w: 


k k 
W = Cov{ C1 Us" .. Cp U8" ep, 


where |v;| < d, kj > k, c; does not contain a subword of the form b*, |b| < d, 
and either v; and c; have no common beginning or, if c¢; = 1, then v; and 


vj41 have no common beginning. Hence w contains r — | disjoint subwords 
d—1 


ve aj,t=1,...,r—1, where 2; is a letter different from the first letter of 
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vj. Here we use that k > d (and that maybe c, = 1). The number of the 
different words of the form v4~!x, |v| < d and x a letter different from the 
first letter of v is 


d-1 
Som! (m— I= mt—m<m!. 
f=1 


If we assume that r > dm“, then some word of the form v4~!x, where |v| < d, 


and «x is not the first letter of v, appears in w at least d times: 


w= qo(v x)qi(v"!e)qo-.-qa-1(v 2) qa. 


By Example 9.2.5, the word v4-' contains d comparable subwords and by 
Lemma 9.2.4 we obtain a d-decomposition of w, which is a contradiction. 
Hence r < dm. In order to prove the inequality for )7;_, |e;|, let us divide 
each of the words c; in several consecutive subwords of length dk and a word 
of length < dk. In this way we obtain 


r le;| 1 r 
° 7=0 cr ° dk i=0 " | " , 


intervals of length dk, where [a] is the integer part of a € R. Since c; does 
not contain subwords of the form 6*, |b] < d, by Lemma 9.2.8 each of these 
intervals has for a beginning an element of the set S with d comparable 
subwords. If p > ds(d,k), then we obtain an element of S$, which appears d 
times as a subword of w. This makes the word w d-decomposable. Since this 
is impossible, we obtain that p < ds(d,k). Using also the inequality 





do leil < dk(wt (r+), 


i=0 
we obtain 


c;| <d2ks(d, k) + dk(r- +1) < @hkmt (@aV@= Dd 1) < 
Yo leel < @ks(d, < ; 
7=0 


< km? (coe i) < stkem! 


Definition 9.2.10 Let R be an algebra generated by r1,...,7%m. Let H bea 
finite set of words of r1,...,7m. One says that Ris of height h with respect 
to the set of words H if h is the minimal integer with the property that, as 
a vector space, R is spanned by all products 


ky uke 


Ui, ++ Uy, 


such that uj,,...,u;, € H andt <A. 
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Exercise 9.2.11 Show that the height of the (commutative) polynomial al- 
gebra K[a1,...,2%m] with respect to the set of words H = {a1,...,%m} is 
equal to m. 

Hint. As a vector space K[x1,...,@m] is spanned by all products an 2. akm 
and the monomials including all m variables (e.g. #1... @»,) cannot be written 
as linear combinations of words with less than m different powers of 2;. 


Exercise 9.2.12 Let R= F,,(U;,(K)) be the relatively free algebra of rank 
m in the variety generated by the algebra U; (A) of the & x k upper triangular 
matrices over an infinite field K. Show that the height of R with respect. to 
the set H = {x1,...,%m} is bounded from above by k(m + 2) — 2. 


Hint. Using the results of Section 5.2, show that R is spanned by all products 


et [ay ay, ]aet.. a(x, 2,,]... [tin @j,]ey eer, 


where a;,);,...,¢; > 0 and p< k—1. Hence A is also spanned by 


bm 


ay a by C1 c 
UT RA Es Rey AP Bs Ue, ey ey pk |, 


involving < mk powers of «; and 2p < 2(k — 1) first powers of #,, and #;,. 


Now we prove the theorem of Shirshov [238] for the height of finitely 
generated Pl-algebras. 


Shirshov Theorem 9.2.13 Let R be a Pl-algebra generated by m elements 
P1,.--;%m and satisfying a polynomial identity of degree d > 1. Then R is of 
finite height with respect to the set of all words r;,...7;, of length s < d. 
Proof. Our considerations will be similar to those in the proof of Theorem 
of Regev 8.1.7 for the exponential growth of the codimension sequence of a 
Pl-algebra. We assume that R satisfies a multilinear polynomial identity of 
the form 

Ly...egs S- AgXo(1)-+-Lo(d)) Ao EK, 

oeSa 

where the summation is on all nontrivial permutations o € Sq. Consider a 
product w= 7;,...77, € FR. First, if the word w is d-decomposable, then we 
can write it as a product of d+ 2 subwords and 


W = WoW, ...WdWd41 > Wo We(1) +++ Wold) Wat 


for any nontrivial permutation o € Sq. Then we apply the polynomial identity 


wo(w1...Wa)Wat1 = S- Og Wo(Wo(1) ++» Wo(a))Wa41- 
oeSa 
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We obtain that w is a linear combination of words which are lower in the 
lexicographic ordering and continue the process until we obtain that all el- 
ements of R are linear combinations of words in 7r1,...,%,, which are not 
d-decomposable. Now we fix the integer k = d. By Theorem 9.2.9, if the word 
is not d-decomposable, then it can be written in the form 


w= court ey us? .. Ur t er, 


where 
: dik?mt — d®m4 


il\<d,kj >k=d,t< dm’, ;|<——= 
Jeil <d, ki 2 < dm d lel < 5 , 





Considering the word ¢; as a product ¢; = uj,...uj, of length q = |e;| with 
respect to the words r1,...,7%m (of length 1 < d), we obtain that Ris spanned 
by all 


_ ky ko ky 
WS UL. Upg Uy Upp tl. Up, U9? Up Up,_ 41». Upy- 
Hence the height of R is bounded by the sum of ¢ and |eo| + ...+ lee] = pe 


and both ¢ and ~, are bounded in terms of d and m, e.g. 


d®m4 


t+pe<dmé+ > 





Exercise 9.2.14 Show that for an m-generated Pl-algebra R satisfying a 
polynomial identity of degree d, the height (with respect to all words of the 
generators of length < d) is bounded by h < O(d?m?). 


Hint. Analyse the proof of Shirshov Theorem 9.2.13. Divide each of the words 
cj,? = 0,1,...,t, in parts of length d— 1 and eventually a shorter residual 
part. Estimate the total number of subwords obtained as a result of this 


dividing. It will be O(d?m*). Add the number t = O(dm) for the words 
kt 


The theorem of Shirshov may be generalized in two directions. One of 
them is to find a better bound for the height of the algebra in terms of the 
number of the generators and the degree of the polynomial identity. The 
other possibility is to find better bound for the length of the words used in 
the theorem. From the first point of view, the best known estimate 


5 4) nd 
he (d? — d*\m 
- 2 
is obtained from the proof of Belov, a simplified version of which we have 
presented. Concerning the length of the words involved in the definition of 
the height of the algebra, it turns out that it is closely related with polynomial 
identities of matrices. 
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Definition 9.2.15 The Pl-algebra R is of Pl-degree k, if k is the largest 
integer such that all multilinear polynomial identities of R follow from the 
multilinear identities of MM), (A). 


Belov [25] showed that, if the algebra satisfies a multilinear polynomial 
identity which does not hold for the (k+1) x (k +1) matrix algebra, then the 
words in the Shirshov theorem can be chosen of length < k. In an implicit 
form the result of Belov is contained also in the works of Ufnarovski [252] 


and Chekanu [48]. 


Theorem 9.2.16 (Belov-Ufnarovski-Chekanu [25, 252, 48]) If R is a finitely 
generated Pl-algebra of PlI-degree < k, then R is of finite height with respect 
to all words of length <k of the generators of R. 


Various generalizations of the Shirshov theorem based on combinatorics of 
words (including infinite words) are given in the survey by Belov, Borisenko 
and Latyshev [27]. In particular, their article contains different relations be- 
tween the bounds for the height with respect to the words of length < d and 
those of length < k (where d is the degree of the polynomial identity and 
k is the Pl-degree). Part of these results are contained also in the master’s 
thesis of Asparouhov [15] (in Bulgarian). There one can also find calculations 
involving the height of concrete relatively free algebras. 

Now we apply Shirshov Theorem 9.2.13 to obtain the results of Levitzki 
and Kaplansky on Kurosch Problem 9.1.4. Pay attention that the statements 
below do not require the nil property or the algebraicity of all elements of 
the algebra as in the original results of Levitzki and Kaplansky. 


Theorem 9.2.17 Let R be a Pl-algebra satisfying a polynomial identity of 
degree d and generated by a finite number of elements r1,...,1m. Let P be 
the set of all products r;,...1:,, k <d. Then 

(i) (Levitzki [172]) If every element of the set P is nil, then the algebra R 
ts nilpotent. 

(ii) (Kaplansky [132]) If every element of P is algebraic, then the algebra 
R is finite dimensional. 


Proof. By Shirshov Theorem 9.2.13, R is spanned by the products 


Rr ye 


w= Uz... U;, 


where ¢ is bounded by the height h and u;, are words of length < d in the 
generators r1,...,?m. Since all possible words u;,; are a finite number, there 
is an upper bound 7 for the class of their nilpotency or for the degree of their 
algebraicity. Hence, if all uj, are nil and the sum kj +...+ ky is sufficiently 
large (e.g. > h(n — 1)), then some u;; appears of degree higher than n— 1 
and the word is equal to 0. If the elements u,;; are algebraic of degree < n, 
then the higher degrees of uj; can be expressed as linear combinations of 
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1, uy ;,- un Hence R is spanned by all words w with &; < n and the 
number of these words is finite. 


9.3 Growth of Algebras and Gelfand-Kirillov 
Dimension 


In this section we shall deal with growth and Gelfand-Kirillov dimension of 
arbitrary finitely generated algebras. For further reading we recommend the 
book by Krause and Lenagan [157] and the survey by Ufnarovski [253]. 


Definition 9.3.1 Let ® be the set of all functions f : NU {0} —> R which 
are eventually monotone increasing and positive valued. This means that 
there exists an no € N such that f(nmo) > 0 and f(n2) > f(ni) > f(no) for 
all ng > ny > no. Define a partial ordering in ® assuming that f ~< g for 
f,g € ® if and only if there exist positive integers a and p such that for all 
sufficiently large n the inequality f(n) < ag(pn) holds and an equivalence 
assuming that f ~ g for f,g € ® if and only if f < g and g < f. We call the 
equivalence class 


G(f)={gE | f~g} 
the growth of f. 


Exercise 9.3.2 If f(n) and g(n) are polynomial functions with positive co- 
efficients of the leading terms, then f ~< g if and only if degf < degg. If 
a,3> 0, then n® ~ n® if and only if a = 7. The functions a” and 2” are 
equivalent if and only if simultaneously either a = @=lora,@>1(a" zg@ 
for 0<a< 1). 


Definition 9.3.3 Let R be a finitely generated (not obligatorily associative) 
algebra and let {r1,...,7m} be a set of generators. Let 


V" = spanin, ...7i, | tj) =1,...,m},n=0,1,2,..., 


where we assume that V° = K if R is unitary and V° = 0 if it is not unitary 
(and for nonassociative algebras the parentheses in the products rj, ...7;,, = 
(r;,.-.)(...1%,,) are distributed in all possible ways). The growth function of 
Ris defined by 


gv(n) = dim(V° + V'4+...4+V"),n=0,1,2,... 


The equivalence class G(R) = G(gv) is called the growth of R (with respect 
to the generating vector space V). 


Exercise 9.3.4 Show that the growth of a finitely generated algebra R does 
not depend on the chosen system of generators. If V = span{ri,...,7%m} 
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and W = span{s,,..., Sm} are the vector spaces spanned by two systems of 
generators of R, then G(gv) = G(gw). 


Hint. Since r1,...,%m generate R, there exists a p such that all s; are in 


YVO+Vl+...4V?. Therefore 
Wot+wle...4w cove svie...4V?", n=0,1,2,..., 


and gw(n) < gv (pn) for every n € N. Similarly we obtain that there exists a 
q € N such that gy(n) < gw(qn), n EN. Hence G(gv) = G(gw). 


Definition 9.3.5 Let R be a finitely generated algebra, with a set of gen- 
erators {r1,...,?m}, and let gy(n) be the growth function of R, where 
V =span{ri,...,?%m}. The Gelfand-Kirillov dimension of R is defined by 


GKdim(R) = limsup(log, gv (n)) = lim sup log gv (n) . 


noo noo logn 


Exercise 9.3.6 Show that the Gelfand-Kirillov dimension of a finitely gen- 
erated algebra does not depend on the choice of the set of generators. 


Hint. Let V and W be finite dimensional subspaces spanned respectively 
by two systems of generators of the algebra R and let GKdimy(R) and 
GKdimy (R) be the Gelfand-Kirillov dimensions of R defined by means of V 
and W. By Exercise 9.3.4, there exists a p € N such that gw(n) < gy(pn). 

Hence 
logon JV(PR) — lo8mn gv (pn 
log, guv(n) < log, gv(pn) = eee fe(BM) _ Te pn ov) 
OBpn 2 1 —log,,, P 

and 


lo n 
lim suplog,, gw(n) < lim sup lo8pn gv (pn) = 
noo pn—oo 1- lognn Pp 


= lim suplog,,,, gv (pr) < limsuplog,, gv (7). 
TL CO 


pn+oo 


Hence GKdimy (R) < GKdimy (R). Similarly GKdimy (R) < GKdimy (R). 


Exercise 9.3.7 Show that, in the above notation, 


GKdim(R) = inf(o ER|G(R) x G(n*)). 


Hint. If GKdim(R) < a, then for sufficiently large n 


log, gv(m) <a = log, n*, gv(n) <n*, G(gy(n)) X G(n*). 
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Similarly, if GKdim(R) > a, then there exists an ¢ > 0 and a sequence 
ny < ne <...such that 


net® <gy(ng), k=1,2,... 


This shows that the inequality G(gy (n)) < G(n®) is impossible because the 


function n?** grows faster than the function n@. 


Exercise 9.3.8 Show that GKdim(A[1,...,#m]) =m. 


Hint. The number of all monomials of degree < n in m variables is equal 
to the number of all monomials of degree n in m + 1 variables since for 
a) +...+4@m <n there exists a 1-1 correspondence 


ay 


a ado a4 a _ 
apt... on™ > epfeyt ek, ag = a — (a, +... + am) 


between these sets. Hence with respect to the usual set of generators of R = 


K[e1,.--,¢m], 
g(n) = (" *™) 


which is a polynomial of degree m. Hence G(R) = G(n™). 


Exercise 9.3.9 Show that the free associative algebra K(a1,...,%m),m > 1, 
has no finite Gelfand-Kirillov dimension. 


Hint. For the usual set of generators of K(x1,...,%m), the growth function 


is equal to g(n) = 1+m+m?+...4+m” and this function grows faster than 


any polynomial function. 





Exercise 9.3.10 Let R be a finitely generated associative or Lie algebra. 
Show that GKdim(R) = 0 if and only if R is a finite dimensional vector 
space. Otherwise GKdim(R) > 1. 


Hint. Let R be generated by a finite dimensional vector space V. Use that for 
associative and Lie algebras V”t! = V"-V and V"+! = [V™, V], respectively. 
If vrett CVO4ViI+...4V"9 for some no, then V"F1 C VO4VI+...4V 
for all n > no and Fis a finite dimensional vector space. Clearly, in this case 
GKdim(R) = 0. Otherwise V0 + V14...4-V"° A V94VE4...4V"F for 
alln = 1,2,... and gy(n) > n. Hence GKdim(R) > 1. 





Exercise 9.3.11 Show that if R is a finitely generated algebra and S is a 
homomorphic image of R, then GKdim(R) > GKdim(S). 


Hint. Using a set of generators of R and their images as a set of generators 


of S, we see that G(S) < G(R). 
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Exercise 9.3.12 If R is a finitely generated algebra and S is a finitely gen- 
erated subalgebra of R, then GKdim(R) > GKdim(S). 


Hint. Let W be the vector space spanned by a set of generators of S. Extend 
it to a vector space V by a set of generators of R. Then W” C V” and 
G(S) x G(R). 


Exercise 9.3.13 Let R be a finitely generated commutative associative alge- 
bra and let S be a finitely generated subalgebra of A such that R is a finitely 
generated S-module. Show that GKdim(R) = GKdim(S). 


Hint. Let R be generated as an S-module by r1,...,7m and let 


m 


k k 
rrp So stra, 5) ES. 
k=1 


If S is generated by s1,...,5p), consider W = span{s(*) ,si} C S and the 
generating space V = W + span{ry, +5 %m} CR. Show that 


m n—-l 


io ee 


k=1 p=0 


Hence gy(n) < (m+ l)gw(n) and G(R) = G(S). On the other hand, Exer- 
cise 9.3.12 gives that G(S) < G(R). Hence G(R) ~ G(S) and GKdim(R) = 
GKdim(S). 


Exercise 9.3.14 Let R be a finitely generated commutative associative al- 
gebra. Using Noether Normalization Theorem (that R contains a polynomial 
subalgebra S such that RF is a finitely generated S-module, see e.g. [161]), 
show that the Gelfand-Kirillov dimension of R is equal to the transcendence 
degree of R (i.e. to the maximal number of algebraically independent ele- 
ments). 


Hint. Use that the Gelfand-Kirillov dimension of S' is equal to the number 
of the free generators of S and apply the previous Exercise 9.3.13. 


Exercise 9.3.15 Let G be a finitely generated Lie algebra and let U = U(G) 
be its universal enveloping algebra. Show that GKdim(U) = dimgG if G is a 
finite dimensional vector space and U has no finite Gelfand-Kirillov dimension 
if G' is infinite dimensional. 


Hint. Let dimG = m and let G has a basis g),..., 9m. Then U is generated 
by the vector space G and Poincaré-Birkhoff-Witt Theorem 1.3.2 gives that 


GO4+G+...4G" =span{g?...g4™ lay t...+am <n}. 
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Hence the growth G(U) of U is equal to the growth of the polynomial algebra 


Klgi,---,9m] in m commuting variables g1,..., ¢m. Let dimG = co and let G 
be generated by some linearly independent elements g1,...,9,. Fix a positive 
integer m > k and choose elements gx41,.--,;9m In G such that 91,...,9m 


are linearly independent. Let V,,, = span{gi,...,9m}. Then U is generated 
by Vin. By Poincaré-Birkhoff-Witt Theorem the monomials gf? ...g¢™ with 
a4 +...+4@m <n are linearly independent. Therefore, 


gv(n) = dim (>: vs) > (" *™) 


which is a polynomial of degree m. Hence GKdim(U) > m for every m > k. 


9.4 Gelfand-Kirillov Dimension of PI-Algebras 


Now we shall prove the theorem of Berele [29] that the finitely generated PI- 
algebras have finite Gelfand-Kirillov dimension. The original proof of Berele 
is based on another (weaker) version of Shirshov Theorem 9.2.13. A proof 
based on structure theory of rings can be found in the book by Krause and 
Lenagan [157]. 


Theorem 9.4.1 (Berele [29]) Every finitely generated Pl-algebra has finite 
Gelfand-Kirillou dimension. 


Proof. Let R be generated by V = span{ri,...,7m} and let it satisfy a 
polynomial identity of degree d. By Shirshov Theorem 9.2.13, there exists an 
integer A, the height of R, such that R is spanned by the words 

ult uk t<h, 


ay it? 


where w;,,..., ui, are all words of length < d. In the proof of the Shirshov 
theorem we have used only that FR satisfies a multilinear polynomial identity. 
Hence V” is spanned on those words with 


kiluz,| +... + kelui,| = nr. 


Therefore, V0+V!+...+V” is a subspace of the vector space spanned by 
all words 
ut yee ky te... thy <n. 


a tn? 
Let p be the number of words of length < d (p = 14+m+...+ m4), 
The number of sequences of indices (71,...,%,) is bounded by p’. Hence the 
dimension gy(n) of V0+V1t+...+V™ is bounded by the product of the 
number of sequences (#],...,é,) and the number of monomials of degree < n 
in A variables, 
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win) so" 7") 


which is a polynomial of degree h. Hence 
GKdim(R) < h, 
the height of R. 


We state as a corollary the last inequality. 


Corollary 9.4.2 The Gelfand-Kirillov dimension of a finitely generated PI- 
algebra R is bounded by the height of R with respect to any finite set of 
monomials of any finite set of generators of R. 


Exercise 9.4.3 If Ris an m-generated Pl-algebra with a polynomial identity 
of degree d, show that GKdim(R) < dm’. 


Hint. Use Theorem 9.2.9 and modify the proof of Theorem 9.4.1. Show that 
R=span{covt'e,...vkrey | S- lei] <a, k1,..., ky > 0, r < dm} 
i=0 


for a being fixed implies that GKdim(R) < dm’. 


Exercise 9.4.4 Let chark = 0 and let R = Fi,(E) be the relatively 
free algebra in the variety generated by the Grassmann algebra. Show that 
GKdim(R) = m. 


Hint. Use Theorem 4.3.11 (i) and the proof of Theorem 5.1.2. As a graded 
vector space F is a direct sum of a finite number of copies of 


(the polynomial algebra) x (homogeneous element) 
because RF has a basis 
ay hm . . . . 
apt... en” [ei,, €io]---[@in_1, in], L< it < ta <...< dop_a < tog <m. 
Starting with the set of generators {21,...,%m}, we obtain 


G(R) =G(K[a,...,¢m]). 


Exercise 9.4.5 Let the field K be infinite and let R = F,(U,(K)) be the 
relatively free algebra of the variety generated by the algebra of k x k upper 
triangular matrices. Show that GKdim(R) = mk. 


Hint. Use the idea in the previous Exercise 9.4.4. As in Exercise 9.2.12, FR is 
spanned by 
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ay... ea [aa,, a;,]e? - am [i.,0j.]-.-[@i,, ej, Jay... ane, p< k, 


and the growth of R is bounded from above by the growth of a finite direct 
sum of polynomial algebras in me variables. On the other hand, show that 
the following elements are linearly independent in R which will give that the 
growth of R is bounded from below by the growth of the same polynomial 
algebra in me variables: 


ee ete lay, woe’... 0b lay, ee]... [ar eejatl.. eS, 


where the number of the commutators is ¢ — 1. 


We give a short survey on some results concerning growth and Gelfand- 
Kirillov dimension of relatively free algebras. For more detailed exposition 
see [87]. Although some of the results below hold for any field or for any infi- 
nite field, we assume that chark = 0. The Gelfand-Kirillov dimension of the 
algebra generated by m generic k x k matrices is equal to the transcendence 
degree of its centre and is (m—1)k? +1 (see the book by Procesi [213]). The 
asymptotic behaviour of the growth of some relatively free algebras was stud- 
ied in the paper of Grishin [120]. In his survey article [106] Formanek gave 
a formula for the Hilbert series of the product of two T-ideals as a function 
of the Hilbert series of the factors (see Halpin [125] for the proof). A trans- 
lation of the formula of Formanek in the language of relatively free algebras 
and codimensions can be found for example in the paper by Drensky [76]. 
Using the result of Formanek, Berele [30, 31] calculated the Gelfand-Kirillov 
dimension of some relatively free algebras with T-ideals which are products 
of T-prime T-ideals (in the classification of Kemer, see Theorem 8.4.10). 
Markov [183] announced that for relatively free algebras F(R) satisfying a 
nonmatrix polynomial identity, the Gelfand-Kirillov dimension is the same 
as the Gelfand-Kirillov dimension of the algebra Fy,(U,(K)), where U,(K) 
is the largest algebra of upper triangular matrices satisfying all polynomial 
identities of R. Theorem 6.2.5 of Belov [26] for the rationality of the Hilbert 
series of relatively free algebras, combined with Theorem 9.4.1 of Berele, im- 
plies that GKdim(/,,(2)) is an integer for any PI-algebra R and any m. This 
result follows also from the theorem of Kemer (see [142]) that the relatively 
free algebra F(R) is representable (i.e. isomorphic to a subalgebra of the 
K-algebra M;,(S) for some k and some commutative algebra S) and the the- 
orem of Markov (announced in [183]) that the Gelfand-Kirillov dimension of 
a finitely generated representable algebra is an integer. 

Since Shirshov Theorem 9.2.13 gives a bound for the Gelfand-Kirillov di- 
mension, it is interesting to see how far is this bound from the real value of 
the Gelfand-Kirillov dimension. Since the Shirshov theorem uses only the ex- 
istence of a polynomial identity, the experiment has to be made correctly, 1.e. 
for relatively free algebras. For example, comparing Exercises 9.2.12, 9.2.14, 
9.4.3 and 9.4.5 (and their hints) we see that for Fy,(U;,(K)) the Gelfand- 


Kirillov dimension can be obtained from the Shirshov theorem. It turns out 
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that one can define the so called essential height which is a modification of the 
original notion of the height introduced by Shirshov (see the survey article 
of Belov, Borisenko and Latyshev [27] and the master’s thesis of Asparouhov 
[15]). In particular, the Gelfand-Kirillov dimension of a finitely generated pre- 
sentable algebra coincides with its essential height [27]. Calculations with the 
height and the Gelfand-Kirillov dimension of concrete relatively free algebras 
are given in [15]. 

The Gelfand-Kirillov dimension of an arbitrary finitely generated com- 
mutative algebra is an integer. We shall give a modification of the example 
of Borho and Kraft [36], showing that the Gelfand-Kirillov dimension of a 
finitely generated Pl-algebra can be equal to any real a > 2. There are 
also some other examples in the book by Krause and Lenagan [157] and the 
survey by Ufnarovski [253]. A result of Bergman (see [157]) shows that there 
exist no algebras with Gelfand-Kirillov dimension in the interval (1, 2). Hence 
the Gelfand-Kirillov dimension of a finitely generated associative algebra can 
have as a value 0, 1 and any a > 2. We also give an example of a finitely 
generated Lie algebra with polynomial identity and with Gelfand-Kirillov di- 
mension any a in the interval (1,2). This is a special case of the examples 
given by Petrogradsky [206] and showing that there exist Lie algebras with 
any prescribed Gelfand-Kirillov dimension a > 1. 


Lemma 9.4.6 Let f(u) and g(u) be two continuous monotone increasing 
functions defined for every u > 0. Let 


t 
a(t) = ant + axt? + ..., (0) = = bt 4 at +... 


be formal power series such that 
f(n) <an <g(n),n=1,2,... 
Then 


nm 


n+l 
/ f(u)du <b, =a, +a9+...+4y <|/ g(ujdu. 
0 1 


Proof. For every k = 1,2,...,n, 
Tk -1) < f(u) < f(k) < an, k-1 Su sk 4h. 


Hence 
k n n k 
flujdu<an, | f(ujdu= 5° f(u)du <a, t+ao+...+4n = bn. 
k-1 0 poy UR-1 
The other inequality can be obtained analogously. 


Till the end of the section we fix an integer & > 2 and the algebra R 
generated by two elements a and y with defining relations 
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yx? yeP?y...yx?*®y=0, pj > 0,2=1,...,k. 


Hence all words containing & +1 times the variable y are equal to 0 in R. We 
assume that the base field K is arbitrary and the algebra R is graded in the 
canonical way. 


Lemma 9.4.7 The algebra R has a basis 
{at yah? y.. xP yeP et! |p; > 0,5 < kf. 
The Gelfand-Kirillov dimension of R is equal to k +1. 


Proof. The free algebra K(x, y) has a basis 
{xP yaP? yo. aP yePst+! |p, > 0,5 =0,1,2...}, 


and the vector space J spanned by the products with s > k is an ideal of 
K(«,y). Hence R = K(x, y)/I and the first part of the lemma is established. 
Therefore the Hilbert series of R is equal (why?) to 


1 1 tf 
Hilb(R, t) = dimR™ t” = —— +t——— +... 4 ———_.. 
ilb(R,#) = ) dime Tt!) ‘G-m T Gop 
n>o 
The coefficient h, = dimR™) is a polynomial of degree k in n. Hence ho + 
hy +...+ hp is a polynomial of degree & + 1 (why?) and this shows that 


GKdim(R) =k +1. 
Exercise 9.4.8 (i) Show that the algebra R satisfies the polynomial identity 


[ay, r9| see [@ordi, Lon+2] = 0. 


(ii) Show that R has the following matrix presentation 


@ = e111 + Eoeo0 +... + Exp eegijegi, ¥ = 12 teaa t+... + ek e441, 


where £1,...,€41 are independent commuting variables. 


Hint. (i) Use that every nonzero commutator of monomials in R is a differ- 
ence of two monomials, each containing the variable y. 

(ii) Show that the algebra generated by these two matrices satisfies the 
relations of R and the basis elements of A are linearly independent in the 
considered matrix algebra. (Repeat the arguments in the proof of Theorem 
5.2.1 for the basis of the relatively free algebra in the variety generated by 
the algebra of upper triangular matrices.) 


For any real number 0 < 2 < 1 we build a sequence ao, a1,... in the 
following way: ap = 0, a, = 1, and, inductively, 
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Gy, —0, if ay tagt+... @n—1 = [n"], 











dy, = 1, if ay taegt... an—1 < [n"], 





where [n“] is the integer part of n’. In this way, for every n = 1,2,..., 


n? —1 <a, tagt+...4+a, = [n*] <n. 


Exercise 9.4.9 Show that for any @ € (0,1) it is possible to construct the 
above sequence do, G1, @9,... 


Hint. Use calculus. By the mean value theorem, for every u > 1 


B _ 4,6 
aT = 9" <1, 


for some € € (u,u+ 1). Hence the difference between n? and (n +1)° is less 
than 1 and this means that [(n + 1)?] — [n°] < 1. Choosing an41; = 0 or 
@n41 = 1 we always may make the sum a, +...+ an41 equal to [(n + 1)*]. 


Now we consider the vector subspace Jg of R spanned by all products 
Pt yeP2y 2 oP yg Pett 


where p1,p3,P4,.--,Pe4+1 are arbitrary nonnegative integers and py runs on 
the set of all n with a, = 0. Clearly, Jg is an ideal of R because yJg = Jey = 0 
and the multiplication by x preserves the property for po. 

For example, for 2 = 0.5, by definition a9 = 0, a, = 1, and 


[V2] = [V3] = 1, a2 = a3 = 0, 
[V4] = 2 > [V3], ag =1,... 


For k = 2, the ideal Jo.5 contains all 








Pt yar yx? Pt yar? ya? aPt ya? yx? 
because dp = ag = a3 = 0, and does not contain 
Pt yet yas xP yxt ya? 
because a; = a4 = 1. 


Lemma 9.4.10 The Hilbert series of the factor algebra R/Jg is equal to 


k-1 
; t4 a(t)t* 
Hilb(R/Jg,t) = ) ——— + —-* 
1 ( /Je,t) Gp * Oa’ 
q=0 
where a(t) = t+ aot? + ast? +... is the generating function of the sequence 


do, G1, @9,... defined above. 
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Proof. The proof follows easily by counting the basis elements of R/Jg. The 
nominators t? indicate the multiplicity g of the appearance of y in the mono- 
mials of the basis of R/Jg. The denominators (1 — t)?+! stay because for 
monomials containing q times y, the x’s behave as the commuting variables 
%1,...,€q41 (2? between i-th and i+ 1-st y’s is the same as ti41)- Finally, in 
the monomials with k y’s, the power x? appears between the first and second 
y’s if and only if a, = 1. 


Theorem 9.4.11 For any real a > 2 there exists an algebra with two gener- 
ators and with Gelfand-Kirillov dimension equal to a. 


Proof. If a is an integer, then it is easy to find an algebra with Gelfand- 
Kirillov dimension equal to a. For a = 2 we can consider the polynomial 
algebra K[x, y] in two commuting variables. For a > 3 the algebra R defined 
above for k = a — 1 has Gelfand-Kirillov dimension & + 1 = a. Now let 
k<a<k+1. We choose 6 = a—k. We shall show that GKdim(R/Jg) = a 
for the algebra R/J, defined in Lemma 9.4.10. By the same lemma, the vector 
space R/Jg is graded and its Hilbert series is equal to 


k-1 
h tg a(t)t* 
n>o 
where a(t) = ait + agt? + ast? + ... satisfies the condition 
P_ 1 cay t...4an = [n*] <n’. 
Hence, by Lemma 9.4.6, the coefficients of the series 
Leer = ap 
x (1-1) 
satisfy the inequalities 


1 


aatan= | (wo —1du< M4 M4. 4D < 
a 


n+l 1 
< | udu = 
~ Sy G+1 


and, by induction on p, 


(n+ HPF! — 1), 





—fi(n) <4) 4. +l) < a(n t k)Ft? + fol(n), 


for some positive y € R and some polynomials f1 (x), fo(«) € R[x] of degree 
<k. Since for n sufficiently large, the coefficients of the part of the Hilbert 
series of R/Jg 
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a 
_ +1 
(1-08 


are also polynomials of degree k — 1, we obtain that 
ankte _ Ay(n) < bo +O) +... 4b, < anh te 4 ho(n) 
for 0<y ER, hi(x), ho(x) € Rl] and degh; < k, i= 1,2. This implies that 
G(R/Ja) = G(nk*®) = Gn"), 
where G(n*) is the equivalence class of n°. Hence GKdim(R/Js) = a. 
Exercise 9.4.12 (Petrogradsky [206]) For any a € (1, 2), there exists a Lie 


algebra Gq with two generators and with Gelfand-Kirillov dimension equal 
to a. 


Hint. Consider the associative algebra R generated by # and y and with 
defining relations 
yyx*y = 0, eyz*y=0,a> 0. 


Let G = R‘) be the vector space R, considered as a Lie algebra with multi- 
plication [u,v] = uv — vu (as in Exercise 1.1.10 (ii)). Show that G' has a basis 
consisting of x, y, 


yad*x = [y,2,2,..., a], [yad* x, y] =l[y,2,a,...,2,y],4=1,2,..., 
Upq = [yad? x, yad? x] = [[y,x,2,..., a], [y,2,...,2]],p >> 1. 
Show that the elements up, span the ideal G” of G. Calculate the Hilbert 
series of G 
Hilb(G,t) =2¢+P +P 404+ ..)-P + So ant”, 
n>5 





where a, 1s the dimension of the homogeneous component of degree n of 
G". The coefficients a, grow linearly and, hence, faster than n*~!. Define 
inductively in the following way an ideal J of G, such that J C G”. The 
first elements of J are for the smallest n with ag, +...+ a, > n®. Choose 
k, linearly independent elements v1,..., vz, of degree n in G” in such a way 
that 

a5 +... +4, — ky = [n°]. 


Let J, be the ideal generated by W,, = span{v1,..., vg, }. Consider the 
subspace Vn4i = [Wp, a]. Its dimension is < k,. Hence 


a5 +...+4n41 —dimW,, — dimV,41 > [(n + 1)*]. 


Choose a homogeneous vector space W,,41 of degree n+ 1 such that Vigqa C 
Wri CG” and 
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a5+...+4n41 —dimW,, — dimW,,41 = [(n + 1)°]. 


Generate by W, + Wr41 an ideal Jp4i of G. Continuing in this way, we 
obtain a sequence of ideals J, C Jn41 C ... such that the coefficients of the 
Hilbert series )7,,5.5 nt” of G"/Jp satisfy 


C5 +...4+¢p = [p®]. 


Let J = JntJnyit+... Derive from here that the Gelfand-Kirillov dimension 
of the Lie algebra G/J is equal to a. 


By Exercise 9.4.8, the example in Theorem 9.4.11 is a homomorphic im- 
age of the relatively free algebra F2(U,41(4)) where Ug4i(K) is the as- 
sociative algebra of (k + 1) x (k + 1) upper triangular matrices. Similarly, 
the example in Exercise 9.4.12 is an image of the relatively free Lie algebra 
F(US(K)). Drensky and Kassabov (see the master’s thesis of Kassabov 
[136]) showed that for k > 3, m > 2 and for any real a in the interval 
[2, GRdim(Fin(U,(K)))] there exists a homomorphic image R of Fin (Us, (K)) 
such that GKdim(R) = a. Similarly, if 1 < a@ < GKdim(F,(U{(K)))], 
k > 3, m > 2, there exists an image G of the relatively free Lie algebra 
Fn (UL (K)) with GKdim(G) = a. 


10. Automorphisms of Free Algebras 


In this chapter we study the automorphism group of the polynomial algebra 
in m variables and of the free associative algebra of rank m, where m is a 
fixed positive integer. In particular, we prove the theorem of Jung [130] and 
Van der Kulk [257] stating that the automorphisms of A [x,y] are tame and 
its noncommutative analogue, the theorem of Makar-Limanov [177] and Czer- 
niakiewicz [53] for the tameness of the automorphisms of A (a, y). The proof 
of the Jung-Van der Kulk theorem which we give here uses locally nilpotent 
derivations. We consider also automorphisms of the polynomial algebra in 
m variables which are exponents of locally nilpotent derivations and show 
that important automorphisms can be obtained in this way. Finally we ex- 
tend our considerations to automorphisms of relatively free algebras F(R), 
where R is some PI-algebra. Most of the main results hold (maybe with dif- 
ferent proofs) for algebras over any field K and, after some restatement, for 
nonunitary algebras. Nevertheless we assume that all algebras are unitary 
and over an algebraically closed field AK of characteristic 0. 


10.1 Automorphisms of Groups and Algebras 


There are two main reasons to study the automorphisms of A [a1,...,¢ml 
and K(a1,...,%m). The first is that the polynomial algebra and the free 
associative algebra are among the important algebras and the second reason 
is the following. 
Let R be any associative algebra finitely generated by a set {r1,..., rm}. 
The map 
L; —7rj,t=1,...,m, 


can be uniquely extended to a homomorphism 
nm: K(t1,...,;%m) OR 


and 
R= K(m,...,€m)/Kerz. 


A possible approach to describe the automorphism group Aut R of the algebra 
FR is to do this in three steps. 
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Step 1. Describe the automorphisms of K(x1,...,%m).- 
Step 2. Find the automorphisms ~ of K(a1,...,%m) which induce automor- 
phisms of R, i.e. such that (Ker) = Kerz. 
Step 3. Try to understand the automorphisms of R which cannot be obtained 
by the first two steps, i.e. cannot be lifted to automorphisms of the free alge- 
bra. 

Instead of the homomorphism a : K(a1,...,®@m) —> R, we can consider 
the homomorphism 


15: Fm(S) = K(e1,...,@m)/(K(a1,...,8%m)NT(S)) — R, 


where S is a Pl-algebra and R satisfies all polynomial identities of S$. It turns 
out that for many algebras S, the relatively free algebra F,(S) has “more” 
automorphisms than the free algebra K(#1,...,®m) and hence induces more 
automorphisms for R. This also allows to use powerful techniques typical 
for the theory of Pl-algebras. Since the class of Pl-algebras is close to the 
class of commutative algebras, some noncommutative results have purely 
commutative consequences. Especially interesting is the algebra of two generic 
2 x 2 matrices, in particular because every of its automorphisms induces an 
automorphism of the polynomial algebra in five variables (see Remark 10.5.3 
below). 

A similar approach is possible also for other classes of algebraic systems, 
e.g. for commutative algebras, Lie algebras, groups, etc. Of course, then we 
should replace the free associative algebra K(a1,...,%m) respectively with 
the polynomial algebra K[x1,..., 2m], the free Lie algebra of rank m, the free 
group of rank m, etc. For further reading we recommend the books by Cohn 
[51] and Kharchenko [147] as well as the collection of papers [102] edited by 
van der Essen and the two survey articles by Drensky [81, 86]. 

Since we consider relatively free algebras only, every map {@1,...,&m}—7 
Fy,(R) can be uniquely extended to an endomorphism of the algebra. Some- 
times we shall use the following notation which also appears in the literature 
and especially in commutative algebra. If ¢ € EndF,,(R), and ¢(a;) = fi, 
t= 1,...,m, we denote ¢ also as 


g= (f1,---, fm). 


We always assume that the endomorphisms act from the left, 1.e. 


(o¥)(u) = (G0 Y)(u) = O(Y(u)), oY € Endlin(R), u € Fm(R). 


In order to simplify the notation we fix an m-dimensional vector space Vj, 
with basis {%1,...,@%m} over the field AK and denote by A[V,,] and K(Vin) 
respectively the polynomial algebra and the free associative algebra freely 
generated by x1,...,2%m. For small m we shall also use other letters, e.g. 
«,y,...,2%, for the free generators of K[Vin], (Vn) and Fi, (R). 

One of the main problems under discussion is the following. 
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Problem 10.1.1 Find a natural and “nice looking” set of generators of the 
automorphism group of K[Vin], K(Vm) and Fin(R), where R is some PI- 
algebra. 


One of the first results in this spirit is for free groups and is due to Nielsen 


[193]. 


Theorem 10.1.2 (Nielsen [193]) The automorphism group of the free group 
Gm of rank m > 2 is generated by the following automorphisms o, 61, 62 
defined by 
(i) o(@i) = %o,(%), where oo belongs to the symmetric group Sm of degree 
m; 
(ii) 01 (a1) = ay, 0, (%;) = «j,i #1; 
(iii) 5 (a1) = U1 109, 5 (a3) = Xj, if 1. 


Traditionally the automorphisms of a finitely generated relatively free 
group G induced by automorphisms of the free group are called tame and the 
other automorphisms are wild. Below we translate the notion of tameness in 
the language of ring theory. 


Exercise 10.1.3 Show that the automorphism group AutA [2] of the poly- 
nomial algebra in one variable is isomorphic to the affine group of the line, 
and a map ¢: « > K[a] induces an automorphism of A[«] if and only if 
é(4) =ax+6,a,8 € K, a #0. Find the inverse of this automorphism. 


Hint. Show that ¢(#) = ax+3 has an inverse of the form 7 1(x) = a7!a+74, 
y € K.If ¢ is any automorphism of A [x], show that for h(x) € K[z], 


deg¢(h(a)) = degd(x)degh(a). 


If degg(x) > 1, then a is not an image of any h(a) € A [a]. 


Since the description of the automorphisms of K[x] = K(x) is trivial, in 
this chapter we assume that the integer m is bigger than 1. 


Exercise 10.1.4 (i) Let g = (aq) be an invertible mx m matrix with entries 
from K and let $1,...,8m € K. Show that the map 


m 
Li > Oe + ) Apitp, t= 1,...,m, 
p=l 


defines an automorphism of [Va]. 
(ii) Show that the following endomorphism of K[V,,] is an automorphism 


@ = (ae, 4+ fi,...,mtm + fm), 
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where 0 4 a; € K, and the polynomials f; = f;(#i41,-...,%m), t= 1,...,m, 
do not depend on #,,..., #; (in particular, fi, is a constant). How to find the 
inverse of 6? Find the inverse of the automorphism of K[2, y, z], where 


b= (Qe +y? +2,-yt+ 27,32). 


(iii) Show that the statements of (i) and (ii) hold also for the free asso- 
ciative algebra and any relatively free algebra. 


Hint. (ii) Apply induction on m. For m = 1 apply Exercise 10.1.3. For m > 1, 
use that ¢ is invertible on A [a,...,2@m] and show that 


1 (a1) = azpta, —a7!fi(e7 1 (ae),..., O71 (@m))- 


Definition 10.1.5 (i) An automorphism ¢ of the polynomial algebra A [Vin] 
is called affine if 


o(xi) = Bi + So apie, i=1,...,m, 
p=l 


where g = (Q@pq) is an invertible m x m matrix with entries from K and 

B1,.--,8m € K. If all 2; are equal to 0, then ¢ 1s called linear and in this way 

we identify the general linear group GL,,(K) with a subgroup of AutK [Vy]. 
(ii) An automorphism ¢ of K[V»] is triangular (or de Jonquiéres), if 


O(a) = age; + filtigi,..-,%m),0O4 a; EC K,t=1,...,m, 


where the polynomial f;(#:41,...,%m) does not depend on a4,..., %;. 

(iii) An automorphism of A[V,,] is called tame if it belongs to the sub- 
group of AutK[V,,] generated by the affine and the triangular automor- 
phisms. 

The other automorphisms of K[V,,] (if they exist) are called wild. 

The notion of tame and wild automorphisms is defined in a similar way for 
K (Vp) and is also extended to the automorphisms of relatively free algebras 


Fy,(R). 
The classical form of Problem 10.1.1 is the following. 
Problem 10.1.6 Are all automorphisms of K[Vin], K(Vin) and Fy, (R) tame? 


It turns out that this is a very difficult problem. For polynomial algebras 
and free associative algebras the affirmative answer is known for m = 2 
only. For m > 2 the problem is still open and there are some evidences that 
probably the answer is negative. 


Exercise 10.1.7 Let ¢ = (fi,..., fm) be an endomorphism of A[V,,] such 
that 
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fi = filti,...,@m) = Bi + gi(ai,...,@m), 6 € K,i=1,...,m, 


and the polynomials g; have no constant terms. Show that ¢ is an automor- 
phism if and only if # = (g1,..., 9m) is an automorphism. 


Hint. Consider the translation r = (#1 + 61,...,%m + Bm). Show that ¢ = 
wor. Since r € AutK[V,,], conclude that ¢ is an automorphism if and only 
if # is an automorphism. 


Definition 10.1.8 We say that the endomorphism # = (91,...,9m) of 
K[Vm] (or of K(V,), or of Fy,(R)) preserves the augmentation (or is an en- 
domorphism without constant terms) if the polynomials g; = g;(#1,...,2%m) 
have no constant terms. (These endomorphisms have the property that they 
preserve the augmentation ideal of K[Vjn], i.e. if h is a polynomial without 
constant term, then the same is #(h).) 


The next exercise shows that the existence of unity is not very impor- 
tant in our considerations and we can define tame automorphisms also for 
nonunitary algebras. 


Exercise 10.1.9 Let 


= (fi +91,---,;8m + 9m), = (915---59m), 8 EK, t=1,...,m, 


and let the polynomials g; have no constant terms. Show that ¢ is tame if 
and only if w is a product of linear and triangular automorphisms without 
constant terms. 


Hint. Apply Exercise 10.1.7. If w is a product of affine and triangular auto- 
morphisms without constant terms, use that the translation 7 : 2; > a; + (; 
is a tame automorphism and derive that @ is tame. If @, is an affine or a 
triangular automorphism and 7, is a translation, show that there exist a 
translation 7. and a linear or a triangular automorphism without constant 
terms y, such that ro ¢, = W107). Hence, if ¢ is tame, then we can decom- 
pose it as @ = Wor = wy 10...0,%07, where each y; 1s a linear or a triangular 
automorphism without constant terms and rT is a translation. 


Exercise 10.1.10 Let G be the subgroup of all augmentation preserving 
automorphisms of K[V;,] and let H be the subgroup of the automorphisms 


w ia; —42;+ terms of higher degree, i= 1,...,m. 


Show that H is a normal subgroup of G and G is a product of GL,(A) and 
H. The automorphisms of H are called sometimes [L-automorphisms (IL = 
Identical Linear component). 


Hint. If 
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m 
wit So apie + terms of higher degree, 7 =1,...,m, 
p=l 


p = (Qpq) € GLm(K), show that pplopope H. 


Exercise 10.1.11 Show that an endomorphism ¢ of K[Vn] (or of K(Vin)) 
is an automorphism if and only if ¢ is surjective, i.e. ¢(K[Vn]) = K[Vn] (or, 


respectively, 6(K (Vin) = K(VYin)). 


Hint. Let w be the augmentation ideal of K[V,] (the ideal of polynomials 
without constant terms) and assume that ¢ is an augmentation preserving 
endomorphism. Show that @ induces a vector space endomorphism @¢, of 
the factors w* /w*t!. Since ¢ is an epimorphism and dimw* /w*t! < oo, we 
obtain that ¢, is an automorphism of w* /w*+?. If h is in the kernel of ¢ and 
hew'\wkt! then ¢4(ht+w*t!) = 0, which is impossible. If ¢ is an arbitrary 
epimorphism, apply Exercise 10.1.7 to reduce the considerations to the case 
when ¢ is augmentation preserving. 


10.2 The Polynomial Algebra in Two Variables 


The main result of this section is the theorem of Jung-Van der Kulk that 
all automorphisms of the polynomial algebra in two variables are tame. The 
theorem was proved by Jung [130] for K = C in 1942 and by Van der Kulk 
[257] for any field in 1953. There are several other proofs based on different 
geometric or combinatorial ideas as those of Engel [100], Gutwirth [122], Na- 
gata [190], Abhyankar and Moh [3], etc. See also the paper by Dicks [56], 
the Ph.D. thesis of Makar-Limanov [178] and the book by Cohn [51]. Some 
of these proofs use essentially that chark = 0 and other work in the gen- 
eral case. Here we give the recent proof of Makar-Limanov [179], which is 
quite transparent and holds for algebraically closed fields of characteristic 
0. With minor additional efforts his considerations can be modified for any 
field of characteristic 0. Since the proof of Makar-Limanov uses locally nilpo- 
tent derivations of K[a, y], we start the section with some preliminaries on 
derivations. Recall (see Exercises 2.1.15 and 3.2.1) that a linear mapping 0 
of a (not necessarily commutative or associative) algebra G is a derivation if 
O(uv) = O(u)v + ud(v) for all u,v € G. As usually, we assume that R is any 
Pl-algebra and F(R) is the corresponding relatively free algebra of rank m. 


Exercise 10.2.1 Show that every map 0: X > F,,(R) can be uniquely 
extended to a derivation of Fy,(R). 


Hint. Define 0 by 
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O(as,... a3, ) “> i, i, Oi, ings Cig: 


Show that 0 is well defined (ie. O(f) = 0 if f = 0). Use that for any mul- 
tihomogeneous f € F(R), we can express O(f) as a sum of polynomials 
obtained by substitutions of O(x;) in the partial linearizations of f. Hence 


f = 0 implies 0(f) = 0. 


Definition 10.2.2 The derivation 0 of the algebra A [Vin], K(Vmn) or Pim (R) 
is called locally nilpotent if for any u in the algebra, there exists a positive 
integer n such that 0"(u) = 0. The derivation is triangular, if O(x;) depends 
ON £j41,---,%m only, t= 1,...,m. 


Exercise 10.2.3 (i) Show that the derivation 0 of K[Vn], K(Vm) or Fin (R) 
is locally nilpotent if and only if it is nilpotent on the generators of the 
algebra. 

(ii) Show that every triangular derivation is locally nilpotent. In particu- 
lar, the usual partial derivative 0/0x,; (defined by 02;/Ox; = Lifi= j and 
0x; /Ox; = 0 if i # j) is locally nilpotent. 


Hint. (i) Use the Leibniz formula 


n! 
n _ n n 
O” (uy... UR) = aol — 7? “(u)...0"* (ug), 
where the summation runs on all nj,...,m, such that ny +...+n, =n. 
Hence 0” (x;, ...2;,) is a linear combination of 0"'(«;,)...0* (a;,) with ny+ 
.. $7, = n. (ii) Use induction on m and show that for n large enough 
O” (a4, ...%;,) does not depend on #1. 


Exercise 10.2.4 (i) Let 0 be a locally nilpotent derivation of A[V,,], and 
let w be in the kernel of 0. Show that the mapping A: u > wA(w) is also a 
locally nilpotent derivation of K[V,,]. 

(ii) Show that the statement of (i) is not always true when we replace 


K[Vm] by K(Vn) or Fim (R). 


Hint. (i) Show that A(uv) = A(u)v +uA(v), u,v © K[Va]. (ii) Consider the 
derivation 0 of K(«,y,z) defined by 0(#) = y, O(y) = O(z) = 0 and w = z. 
Show that A = 20 satisfies 


A(a*) = z(ay + yx), A(a)e + a A(x) = zyx + wzy. 
Exercise 10.2.5 Show that the following mappings induce locally nilpotent 


derivations: 


(i) O(z) = y*, A(y) = 1, for K[z, y], K(x, y) and F2(R); 
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(ii) O(x) = —2yw, A(y) = zw, O(z) = 0, where w = y? + 2x2, for K[z, y, 2]; 

(ili) O(a) = yw, O(y) = zw, O(z) = 0, where w = y? — 2ez, for K[z, y, 2]; 

(iv) O(@) = yw, O(y) = 0, O(z) = wy, where w = xy — yz, for K[x, y, z], 
Ke, y,2) or F3(R); 

(v) O(a) = wz, O(y) = tw, O(z) = A(t) = 0, where w = tx — yz, for 
K[a,y,2,t], (a, y, z,t) or Fy(R). 


Hint. Show that in the case of polynomial algebras O(w) = 0 in (ii)—(v). For 
(relatively) free algebras use Exercise 10.2.3 (i). 


Exercise 10.2.6 Show that the derivations in Exercise 10.2.5 (ii) and (iii) 
are of the form 0 = w@,, where Q; is a locally nilpotent derivation and 
w € Kerd,. 


Hint. In the notation of Exercise 10.2.5 (ii) and (iii), define 0, respectively 
as 


O1(%) = —2y, O1(y) = 2, A(z) = 0, 
O1(@) = y, A(y) = z, A(z) = 9. 
Definition 10.2.7 Let @ be an endomorphism of A[V,,] and let 0/0z, be 


the usual partial derivative with respect to a». The Jacobsan matrix of ¢ is 
the m x m matrix with entries from K[Vn] 


Oo(t1) 9¢(v2) o¢(@m) 
On, On, ste On, 
O¢b(e1 O¢(L2 Od(Lm 
On2 On2 ote On2 
J(o) = 
O¢b(e1 O¢(L2 Od(Lm 
OL m OL m ote OL m 


Warning 10.2.8 In our notation, the (7, 7)-entry of the Jacobian matrix of 
@ is O¢(«;)/Ox;. In commutative algebra one often denotes the endomor- 
phisms by f = (f1,..-, fm), using even capitals: F = (F,..., /), and the 
composition of f with g = (gi,..-;9m) is defined by 


£(9) = (Fi(G1, ++ +59m), +++; fm (915-5 9m))- 


Then it is more convenient to define the (¢,j)-entry of the Jacobian matrix 
J(f) of f as Of;/Ox;. Clearly, replacing our Jacobian matrix with its trans- 
pose would force changes also in other formulas, e.g. in the chain rule in the 
next exercise. 
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Exercise 10.2.9 (i) Show that the Jacobian matrix satisfies the chain rule: 
If é and w are endomorphisms of K[V,,], then 


I(go¥) = J(ejeoT)), 
where $(J()) means that we apply ¢ to the entries of J(«). 

(ii) Show that if ¢ is an automorphism of A[V,,], then J(@) is invertible. 
Hint. (i) Let o(a;) = fi, b(a;) = a, t = 1,...,m. Then (¢ 0 W)(a;) = 
ai(fi, oe fm) and 

A(G ov) (tq) _ Adal fis +s fm) _ 


OL» OL» ~ 


oS Aga fis+- +s dm) Ofek , (IV(xa) \ 96(en) 





k=1 


and this sonplies| the chain rule. 
(ii) Use that the Jacobian matrix of ¢ 0 ¢7! is the identity matrix, i.e. 
J(@) is invertible. 


Now we have the necessary background on derivations and start the proof 
of Makar-Limanov [179] of the theorem of Jung-Van der Kulk. 


Definition 10.2.10 Let 0 be a locally nilpotent derivation of an algebra R. 
We define a degree function deg, on R by 


deg,(u) = min(n | A"(u) = 0) -—1, if OF UER. 
We also assume that degs(0) = —o0. 


For example, if R = K[«] and @ is the usual derivative, we obtain 
the ordinary degree function: to make zero from a polynomial of degree 
3 we should take four derivatives. For any derivation 0, if O(u) # 0 then 


dega(O(u)) = dega(u) — 1. 


Lemma 10.2.11 Jf the algebra R has no zero divisors and if 0 is a locally 
nilpotent derivation of R, then for every u,v © R 

(i) dega(u+ v) < max(deg,(u), dega(w)), 

(ii) dega(uv) = dega(u) + dega(v). 

(iii) If uv £0 and O(uv) = 0, then O(u) = O(v) = 0. 


Proof. The inequality (i) is obvious because O(u + v) = O(u) + O(v). The 
equality (ii) follows from the Leibniz formula 


a" (uv) = > (;) ar—*(w)d* (v). 
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If p = degg(u) and g = dega(v) then 0?*4t!(uv) = 0. On the other hand 
aPt4(uv) = ("* ‘) A (u)A1(v) £0. 

The statement of (iii) follows from (ii) because 
0 = deg,(uv) = dega(u) + dega(v), 


where deg,(u) and degy(v) are nonnegative integers. Therefore 


dega(u) = dega(v) = 0. 


Now we consider the polynomial algebra A’[x, y]. Let 


dude dude 
Ox Oy Oy Ox 
be the Jacobian of (u,v), it is the determinant of the Jacobian matrix. With 


any polynomial f of K[a,y] we associate a derivation as in the following 
lemma. 


Jac(u,v) = Ugly — UyY, = 


Lemma 10.2.12 Let f be a fired element of K[x,y] and let O(v) : K[x,y] 3 
K[z,y] be the mapping defined by 


O(v) = Jac(f,v), v € K[a, yl. 


Then O is a derivation of K[z,y]. If f = o(«) and ¢ is an automorphism of 
K[e, yl, then 0 is locally nilpotent. 


Proof. It is clear from the definition of the Jacobian that for any fixed element 
feKlx,y] 


0 0 
O= tea luge 


is a derivation (see the hint to Exercise 2.1.15). Now, let f = ¢(), g = ¢(y) 
for some automorphism ¢ of K[z,y]. By Exercise 10.2.9 (ii), the Jacobian 
matrix of ¢ is invertible, 0 4 a = det(J(¢)) € K and K[x,y] = K[f,g]. For 
w= f and w = g we obtain that 


O(f) = Jac(f, f) = 0, O(g) = Jac(f,g) =a EK, a°(g) = 0. 


Since the derivation 0 acts nilpotently on the generators f and g of the 
algebra A[f, g] = A[a,y], by Exercise 10.2.3 (i), it is locally nilpotent. 


It is too difficult to describe the polynomials ¢(«), where ¢ is an auto- 
morphism of K[«, y]. Instead we shall describe their top homogeneous com- 
ponents. We fix two positive relatively prime integers p and q and assign to 
x weight p and to y weight gq. In this way, we give K[a,y] the structure of a 
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graded vector space. For every polynomial v(x, y) we denote by @ its leading 
homogeneous component with respect to this grading. If v = v, then v is 
(p, ¢)-homogeneous. 

For example, let p = 2, q = 5 and let 





v(a,y) = 5a®y? — wy? + 2atly t 6xty? — 3at°y. 
Then 5a°y?, —ay® and 2a!'y are of degree 
6x243x5=1x24+5x5=11x241x5= 27, 


6aty? and —3a!°y are respectively of degree 4x 2+2x5=18 and 10x 24 
1 x 5 = 25. Hence 





v(,y) = 5e°y? — xy? + 2a! 


Lemma 10.2.13 (i) Let u,v € K[a,y]. If Jac(u,v) # 0, then Jac(u,v) = 
Jac(%, B). 
(ii) If f is a fixed element of K[x,y] and the derivation of K[x, y] defined 
by O(v) = Jac(f,v), v © K[x, y], ts locally nilpotent, then the derivation 0, 
defined by 
A,(v) = Jac(f,v), v € Kx, yl, 


ts also locally nilpotent. 
Proof. (i) Since 
Jac(aty?, #°y*) = (ad — be)att ety? ttt, 


and Jac(u,v) is a linear combination of Jac(x*y’, z°y“), we obtain that the 
leading homogeneous component of Jac(u,v) is not heavier than Jac(@, ¥) 
(with respect to our (p, ¢)-grading). If Jac(%, ¥) 4 0, then we have the required 
equality. 

(ii) Let O(v) = Jac(f, v) be a locally nilpotent derivation. If u,v are (p, q)- 
homogeneous and Jac(u, v) # 0, then Jac(w, v) is also (p, ¢)-homogeneous and 
a implies that Of (x) is (p, ¢)-homogeneous for all n and 0"(«#) = OP (x) if 

dP («) # 0. Since O"(#) = 0 for some n, we obtain that Of(«) = 0 for some 
ke <n. Similar arguments for y give that d\(y) = 0 for some J. Hence Q) is 
locally nilpotent by Exercise 10.2.3 (i). 





Lemma 10.2.14 /f f(z,y) © K[z,y] ts (p,q)-homogeneous then f can be 
decomposed in the form 


k 
f(e,y) =axy? [](e1- By’), OF EK, 2 EK, a,b, k > 0. 
ec=l 


Proof. We write f(x,y) in the form 
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f(a, y) = arty? (a + ayaty® +... + appa ty! + any”), 


a,a; € K, a,ax # 0. Since f is (p, g)-homogeneous, there exists a d > 0 such 
that 
d = pag = pay + qby =... = pag-1t qbp—-1 = bg. 


Since p and q are relatively prime, all ao, a@1,...,@%—1 are divisible by q. 
Similarly all 5;,...,8, are divisible by p. Hence 


f(x,y) = ety? (8° 4ayst™ 4... + ag_ps te? + apt”), 
where s = «7,t = y” and 


eco= ey tdy=...=eg-ptdg_-1 = dy = 


2 (8 


om" 


Since the field is algebraically closed, we can decompose f(x, y) as desired. 
In most of the proofs of the theorem of Jung-Van der Kulk one needs 

to show that the leading components of ¢(x) and ¢(y) with respect to some 

grading of K [x,y] have a nice form for any automorphism ¢ of K[z, y]. 


Lemma 10.2.15 Let ¢ € AutK |x, y] and let p,q be relatively prime positive 
integers. Then the leading component (x) of @(a) with respect to the (p, q)- 
grading of K[x,y] has the form az’, ay! or a(x?—By?)*, for some a, BE K, 


a#0. 


Proof. By Lemma 10.2.14, 





k 


d(a) = ony? T] (x? - Bey"), OF OE K, EK. 


c=l 





We consider the derivations 0 and 0, of K[a, y] defined by 
A(v) = Jac(o(z), v), 1 (v) = Jac(o(z), v), VE K[z, yl. 


Since 0; (¢(x)) = 0, according to Lemma 10.2.11 (iii), all factors of 4(x) are 
O\-constants. If we have two essentially different (algebraically independent) 
factors, say #7 — By? and «x? — Boy”, then O)(@2) = 0, (y?) = 0. Applying 
once again Lemma 10.2.11 (iii), we obtain that 0)(#) = 0,(y) = 0 and, hence, 
0, = 0. This is impossible, because 0; is not the zero derivation. 





Lemma 10.2.16 Let ¢ € AutK [x,y]. If the leading component ¢(a) of ¢(x) 
with respect to the (p,q)-grading of K[x,y] has the form a(x% — By?)*, and 
B#0, then p=1 org=1. 

Proof. If ¢(x) = (2% — By?)* then, by Lemmas 10.2.12 and 10.2.13 (ii), the 
derivation 0; of K[zx, y] defined by 0,(v) = Jac((x4— By’)*, v), v € K[z, y}, is 


10.2 The Polynomial Algebra in Two Variables 163 


locally nilpotent. Let 02 be the derivation defined by 02(v) = Jac(%?—y", v), 
v € K[z,y]. Then x? — By? € Kerdy and 0; = k(x? — By? )’-10,. Hence, as 
in Exercise 10.2.4 (i), 


P= (k(2t — ByP)*~")" 5 


and 0, is also locally nilpotent. Now 02(#) = Bpy’—+ and do(y) = qxt!. If 
we consider the degree function deg = dega, relative to 02 (see Definition 


10.2.10) and denote by d, = deg(z) and dy = deg(y) then 
dy —1= (p—1)dy, dy —1 = (q- Dade. 





Hence —2 = (¢ — 2)d, + (p — 2)dy. Since p and ¢ are positive integers, it is 
possible only if p or g is equal to 1. 


Now we prove the theorem which describes the automorphisms of K[#, y]. 


Theorem 10.2.17 (Jung [130] and Van der Kulk [257]) Every automorphism 
of K[x, y] ts tame. 


Proof. Let @ € AutK[z, y]. As usually, we denote by u, and u, the partial 
derivatives of u with respect to z and y. We apply induction on the product 
deg,o(x) - deg, (x). The base of the induction is when this product is 0, ie. 
#(“) does not depend on one of the variables x and y. If d(«) depends only 
on x, then 


Jac(¢(a), d(y)) = o(z)eb(y)y 


is a nonzero constant. Therefore ¢(x), and $(y)y are also nonzero constants 
and 


g(x) = AL + B, o(y) =7Y + p(x), a, 8,7 € K,a,y # 0, p(#) € K{a]. 


Hence ¢ is a triangular automorphism. If ¢(«) depends only on y, then 


o(4) =ay+ 6, o(y) =ye4+ply), a,8,7 € K,a,yF 0, ply) € Ay]. 


In this case ¢ = wor, where ~ is a triangular automorphism and 7 is the 
automorphism defined by r(x) = y, T(y) = #, i.e. 7 changes the places of x 
and y. Hence, in both the cases, ¢ is tame. Now, let ¢(a) depend on both x 
and y. Let 








deg, @(z) =a > 0, deg, d(x) = 6 > 0. 


We choose relatively prime positive integers p and q such that pa = q6 and in- 
troduce the (p, q)-grading on K[a, y]. Clearly, the leading (p, g)-homogeneous 
component ¢(«) of ¢(a) depends on both « and y. (Why? Answer: If the (p, ¢)- 
degree of ¢(x) is equal to ap, then ¢(#) contains as a summand the monomial 
of ¢(a) of maximal degree a in x as well as the monomial of maximal degree 6 
in y. If the (p, g)-degree of ¢(a) is bigger than ap, then each monomial of ¢(x) 
depends both on « and y.) By Lemma 10.2.16, p = 1 or gq = 1 and ¢(x) is 
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either of the form a(y—x7)* (when p = 1) or of the form a(x — By? )* (when 
q = 1), in both the cases 0 4 a, 8 € K. If (xz) = a(y—x2)*, let us compose 
¢@ with the tame automorphism ~ defined by (x) = x, o(y) = y+ 6x". Then 


(0 ¢)(x) = ay*. Now 
deg, ( 0 4) (a) < deg, (x), deg, (v © 9)(x) = deg, 9(x) 


(why?). Similarly, if ¢(7) = a(« — Gy’)*, then we compose ¢ with the tame 
automorphism w given by W(x) = x + By’, u(y) = y. Now (Wo ¢)(x) = azx* 


and 
deg, (v 0 $)(x) = deg, o(x), deg, (a 0) (a) < deg, o(a). 


By inductive arguments, the automorphism 7 o ¢ is tame and, since ¢ is also 
tame, we obtain the tameness of ¢. 


The group AutA[, y] has a description in terms of combinatorial group 
theory which follows from the careful study of the proof of Van der Kulk [257]. 
A version of the proof can be found in the book by Cohn [51] or in the paper 
by Dicks [56]. See also the important paper by Wright [267] which contains 
also results for the automorphism group of the C-algebra AutC[z, y], where 
C' is a commutative domain. 


Definition 10.2.18 Let G, and G2 be two groups, containing respectively 
subgroups H, and H» isomorphic to a given group H. The free product of 
Gy, and Gy amalgamating H is the group G, *4 G2 with generating set con- 
sisting on the disjoint union of generating sets of G; and Ge, with defining 
relations consisting of the relations of G1, Gz and of relations identifying the 
corresponding elements of H, and Hy. In other words, if G; and Gz are gen- 
erated respectively by {g, |p € P} and {gj | ¢ € Q} with defining relations 
respectively Ry and Re, and y; : H > H; is the isomorphism of H and H;, 
? = 1,2, then G, *y G2 is generated by {g,,97 | p © P,g € Q} and has 
defining relations Ry U RyU {i (h) = ve(h) | h € A}. 


Example 10.2.19 Let G; = (x | x* = 1) be the cyclic group of order 4, 
let Go = (y,z | y? = 2° = yzyz = 1) be the symmetric group $3, where we 
identify y with a transposition and z with a cycle of length 3. Let H be the 
cyclic group of order 2 embedded into G; and G2 by H, = (27), Hz = (y). 
Then G #4 Gz is the group generated by x,y,z (the generators of G; and 
G2) and with defining relations 


a’ =1 (relations of Gi), y*? = 2° = yzyz = 1 (relations of G.), 
y=" (identification of the elements of H, and Hz). 


For example, we calculate the element eyzyzya° zz of G,*4 G2. First, working 
in Gy we present yzyzy as (yzyz)y = y and 


ayzyzye ze = x(yzyzy)e% ze = eye? ze. 
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Using that y = a7, we see that 


ayer ca = aaa ze = ese = area. 


For the properties of amalgamated free products of groups see e.g. the 
book by Magnus, Karrass and Solitar [176] (or, for the minimum needed 
to read the proof of the next theorem, the paper by Wright [267]). The 
combinatorial description of Aut A [a, y] mentioned above is the following. 


Theorem 10.2.20 The group AutK[2, y] is the amalgamated free product of 
the group of affine automorphisms and the group of triangular automorphisms 
over their intersection with the triangular affine automorphisms. In other 
words, if G, ts the affine group of the two-dimensional vector space with basis 
{x,y} and Go is the group of triangular automorphisms (a, a+f(y), a2y+(2), 
O04 a1,02E K, Bo EK, f(y) € K[y], and H is the intersection of these two 
groups, 


H={(a.e By t Y,2Y G2) | 04 a1,a2 € K, 1, Bo,y € K}, 


then Aut K [x,y] = Gi #9 Go. 





An equivalent form of this theorem in the language of IL-automorphisms 
is given by Shpilrain and Yu [242]. We give a weaker version of their result. 


Theorem 10.2.21 (Shpilrain and Yu [242]) Let G be the group of IL- 
automorphisms of K[x,y], t.e. the automorphisms ¢@ of K[x,y] such that 


d(x) =at+..., Oy) =yt..., 


where ... means a sum of monomials of higher degree. Considering G as 
a subgroup of the group Aut? K[x, y] of augmentation preserving automor- 
phisms, G is the normal closure in Aut? K[z, y] of the subgroup H generated 
by the elementary automorphisms 


Or = (a+ f(y), ¥), Wg = (z,yt+9(z)), 


where f(y) and g(a) are polynomials without constant and linear terms. The 
group H is isomorphic to the free product of the abelian groups 


He = {os = (4+ f(y), y) © GY, Hy = tg = (2, y + 9(#)) € GY. 


In other words, every automorphism y in H can be uniquely presented as a 
product 


N= OF Og OOF, Og, 0---0Gf, OR, & =0,1,7= 1,2, 


and every automorphism 6 in G has a presentation 
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g= (o1mo7) (can ') a (opMpo,)s ny © H,o; € Aut’ K[a, y]. 


Exercise 10.2.22 Show that the groups H, and Hy in Theorem 10.2.21 
are “naturally” isomorphic to the additive groups of all polynomials in one 
variable without constant and linear terms. 


Hint. Show that the composition of (a+ fi(y),y) and (« + fo(y), y) is equal 
to (x + fily) + foly),y)- 


10.3 The Free Associative Algebra of Rank Two 


In this section we shall prove the theorem of Makar-Limanov [177] and Czer- 
niakiewicz [53] describing the automorphisms of K(x, y). Proofs can be also 
found in the book of Cohn [51] or in the paper of Dicks [57]. Our proof is 
based on the proof in [51]. 


Definition 10.3.1 Let p,q be two integers, not both equal to zero. 

(i) We define a (p,q)-bidegree D : K(x, y) 4 Z?U {-oo} assuming that 
the bidegree of % is equal to (1,0), the bidegree of y is (0,1) and D(0) = —ov. 
(Hence D is the usual bidegree on A (a, y). A monomial u is of (p, ¢)-bidegree 
equal to (a1, 42) if « appears a; times in u and y appears ay times.) 

(ii) We introduce a linear ordering on Z?: If (a1, a2), (61,62) € Z?, then 
(a1, d2) ~ (b1, 62) if and only if ayptaaq < bip+beq or, if a;pt+aoq = biptbag 
then ayg < 61g. 

(iii) For the polynomial f(#,y) € K(x, y) we denote by supp(f) the set 
of all points (a1, a2) in (NU {0}) x (NU {0}) such that f contains monomials 
of (p, q)-bidegree (a1, a2). The <-leading component f of f is the sum of all 
monomials of f which are of maximal degree with respect to <. 

(iv) We also define a (p,q)-degree function d : K(x,y) —~ ZU {-co} 
assuming that d(#) = p, d(y) = q. 


Example 10.3.2 Let p = 3, ¢ = —2. Then 
(2,4) ~ (5,6) ~ (3,3), 
because 
2-344-(-2) = -2 <5-346-(-2)=3-343.-(-2) = 38, 5-(-2) < 3- (2). 


Hence, for 





f = 2yx7yry — a? ya — wt yl ex t+ yey? + Qeyx7y’ 


the <-leading component is 
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f = 2yx7 yxy — v7 yx + Qeyx7y’. 


Exercise 10.3.3 Show that the homomorphism of additive groups 

\:Z? —+Z,X: (a1, 42) > arp + aaq, 
preserves the grading of Definition 10.3.1 (ii), i.e. if a = (a1, a2), b = (b1, be) 
and a < 6, then A(a) < A(8). 


Exercise 10.3.4 Let p and q be integers, not both equal to zero, and let 
f and g be two polynomials in K(x, y) such that their leading components 
f and 7 with respect to the (p, q)-bigrading are independent, i.e. generate a 
free subalgebra of A(x, y). Show that f and g generate a free subalgebra of 
K(x, y), the leading terms of the polynomials in K(f, g) are contained in the 
free subalgebra generated by f and 7 and 


{D(h) [he K(f, 9)} C (NU {0}) - DUP) + (NU {0}) - D(g) U {00}. 


Hint. Use that the (p, ¢)-bigrading of K(x, y) satisfies the conditions 
D(u)D(v) = D(uv), D(wt+ v) X max(D(u), D(v)), u,v € Ka, y). 
Derive from here the statement of the exercise. 


Lemma 10.3.5 Let f,g © K(x, y) be homogeneous with respect to the (p, q)- 
bigrading of K(x, y). Then either f and g generate a free subalgebra of K(«, y) 
or, up to multiplicative constants, they both are powers of the same homoge- 
neous element of K(x, y). 


Proof. Without loss of generality we may assume that f and g are not con- 
stants in the field K. Since the (p, q)-homogeneous elements of K(x, y) are 
multihomogeneous in the usual sense, we obtain that f(0,0) = g(0,0) = 0. 
Let f and g be dependent, i.e. they do not generate a free algebra. Therefore 
h(f,g) = 0 for some polynomial 0 4 A(t, u) € A(t, u). Then 


0 = h(f(0, 0), 9(0,0)) = ACO, 0). 
Hence A(t, uw) has no constant term, A(t, uv) has the form 
A(t, u) = thy (t, wu) + uho(t, wu), 
where Aj(t, u), ho(t,u) © A(t, u), and 
h(f,g) = Fai (fg) + gha(f,g) = 0. 


Let the length of the monomials of f be bigger than or equal to the length 
of the monomials of g. Then f = gf; for some f; € K(x,y). (This fact is 
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not trivial. If some problems arise, see Section 6.7 of the book by Cohn [51].) 
Hence 


0=A(f,9) =9fihilgh.g) + ghel(gfi,g) = gh'(fi,g)- 


Since the free algebra A (a, y) has no zero divisors, we obtain that h’(f,g) = 0 
and, by induction, f; = av’, g = Bu', where a, 3 € K and v is a homogeneous 
polynomial in K(x, y). 


Proposition 10.3.6 Let f and g be two polynomials of K(a,y) such that f 
is congruent to x and g is congruent to y modulo the commutator ideal of 
Kia, y). Let f #« org #y. Then there exist two integers p and q, not both 
equal to zero, such thatp <0 <q, 


Df) + Dg) # D(x) + Dy) = (LY) 


in the (p, q)-bigrading of K(x, y), 


(d(f), d(g)) = (d(x), d(y)) = (P, 4) 


in the (p,q)-grading and f and J are independent. Moreover, ¢ = (f,g) is 
not an automorphism of K(x, y) and [f,g] 4 a[z, y], for all nonzero constants 
ae Kk. 


Proof. Let supp(f) be the set of all pairs (a1,a2) such that there exists a 
monomial of f of degree a; and az, respectively to x and to y. Since 


f = 2+ So ajuj[x, ye, I= yt S > Bj uile, yey, 


for some a;, 9; € K and some monomials u;, vj, uj, vj € K(2,y), all mono- 


mials of f contain x and all monomials of g contain y. Let us denote 
supp(«~*f) = {(a1 — 1, a2) | (a1, a2) € supp(f)}, 


supp(y~'g) = {(b1, b2 — 1) | (b1, 62) € supp(g)}. 


Clearly, the union S of supp(f) and supp(g), with the origin removed, lies 
in the first quadrant. If S contains a point (0,p2) with po #4 0, we choose 
p = —ps, q = 0. If S contains no such a point, we choose (p1,p2) € S, 
(pi, p2) # (0,0), such that the quotient po/p; is maximal and put p = —pa, 
q=Pi- 

Now we consider the ordering < induced by p and q. Let us assume that 
q > 0, the case g = 0 is similar. By the choice of p and gq, for a point (a1, a2) 
of S with ag/a; < po/p, we obtain that —a,po + aop1 = ayp+asq < 0 
and (a1,@2) < (0,0). If a2/a1 = po/pi, then a2qg > 0 and (0,0) ~ (a1, a2). 
Hence all “positive” points of S lie on the half-line L from the origin (0,0) 
through (p1,p2) = (¢,-p). Now supp(x7!f) contains the origin, which is 
also on L. Hence the (p,q)-bidegrees of f and x satisfy D(x) < D(f) and 
D(f) — D(a) © L. Similarly, D(g) — D(y) © L. By the choice of p and 
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q, at least one of D(f) — D(x) and D(g) — D(y) is different from (0,0). 
Hence D(x) + D(y) ~ D(f) + D(g). Besides, for the (p,q)-degree we obtain 
d(f) — d(a) = 0 (because D(f) — D(x) € L), similarly d(g) — d(y) = 0, ie. 
(d(f), d(g)) = (d(“), d(y)) = (p,q). If the leading components f and 7 are 
dependent, then by Lemma 10.3.5, up to multiplicative constants, they are 
positive powers of a homogeneous element v of A (a, y). Hence both d(f) and 
d(g) are positive multiples of d(v) which is impossible, because d(f) = d(x) = 
p<0<q=d(y) =d(g). Therefore f and 7 are independent. 

Let ¢ = (f,g) be an automorphism of K(z,y). Then f and g generate 
K(«,y) and «,y € K(f,g). Hence 


D(x) = (1,9), Dy) = (0, 1) € {D(u) | we ACF, 9)}. 
Since f and 7 are independent, by Exercise 10.3.4 we obtain that 
N* € {D(u) | we K(f,g)} C (NU {0}) - DF) + (NU {0}) - D(g) U {cv}. 


But this is impossible, because (1,1) <~ D(f) + D(g) implies {D(f), D(g)} F 
{D(«), D(y)} and (NU {0})-D(f) + (NU {0})- D(g) does not cover the whole 
first quadrant. Hence ¢ = (f,g) is not an automorphism. 

Finally, since f and 7 are independent, 


fo = SG #GS = 9f 
and hence we obtain for the (p, q)-bigrading that 
D(If,g]) = max(D(fg), D(gf)) = D(f) + Dig) > D(z) + DY) = Dla, yl). 
This means that [f,g] # a[z,y] for OA ae K. 


The following theorem is the main result of the section. It can be viewed 
as a noncommutative analogue of Jung-Van der Kulk Theorem 10.2.17. 


Theorem 10.3.7 (Makar-Limanov [177] and Czerniakiewicz [53]) All auto- 
morphisms of the free algebra K(x, y) are tame. 


Proof. There exists a natural homomorphism 
nm: AutK (x,y) —> AutK[z, yJ. 


For every automorphism ¢ of A (x,y) the automorphism a(¢) of A[x, y] 
sends « and y respectively to ¢(a) and ¢(y) modulo the commutator ideal 
of K(x, y). Every affine or triangular automorphism of K[x, y] can be obvi- 
ously lifted to an affine or triangular automorphism of K(x, y). By Theorem 
10.2.17, every automorphism of K[2, y] is tame, i.e. a product of affine and 
triangular automorphisms. Hence the homomorphism 7a is an epimorphism 
and every automorphism of K[2, y] is induced by some automorphism of the 
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free algebra K(x, y). We shall prove the theorem if we establish that the 
kernel of 7 is trivial. Let ¢ = (f,g) € Kerm. Hence, 


da) = f=etS ajule,ylui, oy) =9 =o t+ >, Guile, yl}, 


where a;, 9; € K, uj, vi, u,v; € K(2,y). By Proposition 10.3.6, if f # x or 
if g # y, then # cannot be an automorphism. Hence, the kernel of ¢ is trivial 
and every automorphism of K(x,y) is a product of affine and triangular 


automorphisms. 


Corollary 10.3.8 The groups AutK (x,y) and AutK[s, y] are isomorphic in 
a canonical way. 


Proof. In the notation of the proof of Theorem 10.3.7, the group homomor- 
phism 
nm: AutK (a, y) — AutK[x, y] 


is an epimorphism with trivial kernel, i.e. 7 is an isomorphism. 


It is easy to recognize whether an endomorphism of K(x, y) is an auto- 
morphism since there is a simple commutator test. 


Theorem 10.3.9 (Dicks [55], Czerniakiewicz [53]) An endomorphism ¢ = 
(f,9) of K(«,y) ts an automorphism if and only if there exists a nonzero 
constant a € kK such that 


O(a, 4) = [fg] = ole, yl. 


The next exercise and Proposition 10.3.6 give some idea about the proof 
of Theorem 10.3.9, although the complete proof needs additional essential 
work. The proof can be found in the paper by Dicks [55] or in the book by 
Cohn [51]. 


Exercise 10.3.10 Show that for every automorphism @ of K(«,y) there 
exists a nonzero constant a € K such that 


o([z, yl) = a[z, yl. 


Hint. For an affine or for a triangular automorphism ¢ of A (a, y) it is easy 
to see that ¢([, y]) = a[x, y], 0 4 a € K. Hence every automorphism, as a 
product of affine and triangular automorphisms, satisfies the same property 
for some a € K. 


Concerning the automorphisms of the free associative algebra K (Vi), the 
situation is similar to that for the automorphisms of the polynomial algebra 
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but the study in the noncommutative case seems to be less intensive than in 
the commutative. The problem whether all automorphisms are tame is still 
open for m > 2. One of the main difficulties is that the proof of Theorem 
10.3.7 is based on the proof of Theorem 10.2.17 and we do not know the 
structure of Aut K[V,,] for m > 2. 


10.4 Exponential Automorphisms 


In this section we consider a class of automorphisms which appears naturally 
also in other algebraic considerations. We assume that the characteristic of 
K is equal to 0 and denote by R any Pl-algebra. 


Exercise 10.4.1 If 0 isa locally nilpotent derivation, show that the mapping 
of K[Vin], K(Ym) or Fin (R) 











(expd)(u) = u4 1 


is well defined and is an automorphism of the algebra. 


Hint. Use that the high powers 0” annihilate u and hence expd is well de- 
fined. Use the Leibniz formula 


nm 


a"(ur) = > (;) ak (ujar-* (v) 


to show that (expQ)(uv) = (expd)(u)(expd)(v). 


Definition 10.4.2 If 0 is a locally nilpotent derivation of K[Vin], K(Vm) or 
Fy,(R), we call expd the exponential automorphism induced by 0. 


Exercise 10.4.3 (see e.g. Daigle and Freudenburg [54]) Show that u is in 
the kernel of the locally nilpotent derivation 0 if and only if (exp@)(u) = w. 


Hint. Use Vandermonde arguments for 


A(u) 
l 


u = exp” O(u) = exp(nd)(u) = ut n+ 2 a (u) 


2! 
and derive that exp” 0(u) = u implies that O(u) = 0. 





+...,n=0,1,2,... 


In 1970 Nagata [190] gave his famous example of an automorphism of 
K[e, y, z] which fixes the variable z and is not tame considered as an auto- 
morphism of the K[z]-algebra (A [z])[«, y]. The automorphism of Nagata can 
be obtained in the following way. 
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Exercise 10.4.4 Let 0 and A be derivations of K[x, y, z] defined by 
A(x) = —2y, A(y) = z, A(z) =9, 
A(x) = —2yw, Aly) = zw, A(z) =0, w= ye tee. 


Show that 0 and A = w@ are locally nilpotent and expA is the automorphism 
of K[«, y, z] defined as 


y= (x — 2y(y? + ez) — 2(y? +z), yt 2(y? + #2), 2). 





This is the Nagata automorphism. 
Hint. Use Exercises 10.2.5, 10.2.6 and 10.4.3. 


Exercise 10.4.5 Let 





by = (w@ — 2vy —v7z, y+ vz, z),ve Ke, y, 2], 


be an endomorphism of K[a, y, z]. Show that ¢, is an automorphism if and 
only if v = vi (y? +2, 2), for some v1 = v1(t1, 2) € K[t1, t2]. In this way one 
obtains a family of “Nagata like” automorphisms. 


Hint. Use the previous Exercise 10.4.4. If in the notation there v € Kerd, then 
gy = exp(v@) is an automorphism. If ¢, is an automorphism, its Jacobian 
matrix is invertible and its determinant is a nonzero scalar in K. Calculate 
the determinant of the Jacobian matrix of ¢,: 


det(J(¢v)) = 1+ 2yve + zuy = 1, 


where v, and vy are the partial derivatives of v. Show that the only solu- 
tions in K[a,y, z] of the partial differential equation 2yv, + zvy = 0 are the 
polynomials v1 (y? + xz, z), where vi(t1,t2) € K[t1, te]. 


Conjecture 10.4.6 (Nagata [190]) The Nagata automorphism of K’[, y, 2] 
is wild. 


There exist many evidences that the behaviour of the Nagata automor- 
phism is different from this of the most tame automorphisms. See for example 
the book by Nagata [190] and the papers of Alev [8], Drensky, Gutierrez and 
Yu [90] and Le Bruyn [170]. 


Definition 10.4.7 An automorphism ¢ of K[Vi,] (respectively of K(Vin) or 
of Fin (R) for some Pl-algebra R) is called stably tame if there exists a positive 
integer p such that the extension of @ to K[Vn4p] (respectively to K(Vintp) 
or to Fin4p(R)) by O(@m4e) = mtn, = 1,...,p, is a tame automorphism. 
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Theorem 10.4.8 (Martha Smith [244]) If 0 is a triangular derivation of 
K[Vin] and w is in the kernel of 0, then the exponential automorphism 
exp(w0) is stably tame. 


Proof. We extend the derivation 0 to K[Vin4i] by O(am41) = 0. Clearly, 
@ is still locally nilpotent, tm41 € Kerd and (prove it!) exp(am4i0) is a 
triangular automorphism of K[Vin41]. Let 7 be the triangular automorphism 


of K[Vin4i] defined by 
T(em41) = tmqi tw, T(a;) = @;,¢=1,...,m. 


Concrete calculations show that 


1 


exp(w0) = T° o exp(—#@m410) 0 7 0 exp(%m410) 


and exp(w@) is a tame automorphism of K[Vin41]. Since exp(w0)(am4i) = 
m4+1; the automorphism exp(w0) of K[V,,] is stably tame. 


Corollary 10.4.9 (Martha Smith [244]) The Nagata automorphism is stably 
tame. 


Proof. By Exercise 10.4.4, the Nagata automorphism is equal to exp(w@), 
where 


A(x) = —2y, Oy) = z, A(z) =0, way? + ez. 
Clearly, 0 is a triangular derivation, w € Kerd and we apply Theorem 10.4.8. 


From some point of view, all known automorphisms of the polynomial 
algebra arise from natural constructions and are “nice looking”. Now we can 
make Problem 1.1.1 precise in the following way. 


Problem 10.4.10 Is the automorphism group of K[Vj,], m > 2, generated 
by: 

(i) automorphisms exp(A), where A = wd, w € Kerd and 0 is a triangular 
derivation; 

(ii) automorphisms exp(A), where A is a locally nilpotent derivation; 

(iii) stably tame automorphisms; 

(iv) other natural automorphisms? 


The above discussion suggests the following question. 


Problem 10.4.11 (Bergman, private communication) Can every nilpotent 
derivation of K[V,,] be obtained by starting with a locally nilpotent derivation 
O of some particular simple sort, multiplying it by a member w from the kernel 
of 0 and conjugating by an automorphism of K[V,,]? 
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Remark 10.4.12 It is interesting to have more examples of naturally arising 
Nagata like automorphisms. This would provide new potential candidates for 
wild automorphisms of A[V,,]. Some wild automorphisms of the C-algebra 
Cl@1,...,@m], where C' is any integral domain, are given in the book by 
Nagata [190] and in the paper by Wright [267]. Up till now, all stably tame 
automorphisms of A[V;,] have been found using locally nilpotent derivations. 
For example, recently Van den Essen [103] and Drensky and Stefanov [97] 
have constructed a family of exponential automorphisms. The proof that the 
automorphisms in [103] and [97] are stably tame is similar to that of Theorem 
10.4.8, but instead of triangular derivations uses the theorem of Suslin [248] 
that for k > 3 every invertible k x k matrix with entries from A[V,,] is a 
product of elementary matrices. We recommend the book by Nowicki [196] on 
derivations of polynomial algebras where one can also find explicit generators 
of the kernels of concrete locally nilpotent derivations. 


The following exercise is a partial case of a result proved by several authors 
for polynomial algebras (van den Essen [101], Shannon and Sweedler [235] 
and Abhyankar and Li [2]) and by Drensky, Gutierrez and Yu [90] in the 
noncommutative case. 


Exercise 10.4.13 Let Fa,,(R) be a relatively free algebra with generators 
L1,-+2; Lm; Yly-+-) Ym and let 


o= (fi(X),.--5 fm(X)), v= (91(X),---,9m(X)) 


be endomorphisms of the subalgebra F,,() generated by x1,...,@m. Then 
é and z are automorphisms of F,,(R) with ~ = $71 if and only if the ideals 
U and V of Fam(R) coincide, where 


U=(a;—-fi(Y) |t=1,....m), V = (wy — a (X) |i =1,...,m). 


Hint. Work modulo the ideals U and V. If @ and w are automorphisms and 
w= 71, then use that 


filgi(X),-.-,9m(X)) = wo o(a;) = Xj, t= 1,...,m, 


and derive that U C V. If U = V, use that in Fon(R) = Foan(R)/U = 
Fap(R)/V - - 

& = fi(ti,---, Ym) = filM(X),---,Gm(X)) 
and wo @(a;) = #;, i.e. the composition 7 o ¢ induces the identity map on 
Fin (R). Besides, the subalgebra generated by %1,...,£m is isomorphic to 
Fy,(R). Hence Ker¢ = (0), Im) = Fi, (R). Similarly, Imé = F(R) and ¢ is 
an automorphism. If difficulties appear, see the paper of Drensky, Gutierrez 


and Yu [90]. 


Remark 10.4.14 In the case of the polynomial algebra K[V,,], if 
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0 = (fi(X),--- fm(X)) 


is an endomorphism of A[Vj,], there exists a good algorithm based on 
Grobner bases techniques, which allows to solve effectively the problem 
whether the ideal 


(x4 —filY),...,@m — fm(Y)) <A lai, -.-5 m5 Y1,---) Ym] 


has a generating set of the form 


(yi — 91(X),---5Y%m — Gm(X)), 


1.e. to determine whether ¢ is an automorphism and to find its inverse. For 
details see some book on Grébner bases (e.g. by Adams and Loustaunau 
[4]) or the papers of van den Essen [101], Shannon and Sweedler [235] or 
Abhyankar and Li [2]. 


We have already seen in Exercise 10.2.9 that the Jacobian matrix of an 
automorphism of the polynomial algebra A[V,,] is invertible. On the other 
hand, the inverse function theorem in calculus states that if the Jacobian 
matrix of a mapping IR” — R”™ is invertible, then the mapping is locally 
invertible. The analogue of this theorem for the polynomial algebra gives 
that if the Jacobian matrix of an augmentation preserving endomorphism is 
invertible, then the endomorphism induces an automorphism of the algebra 
of the formal power series. The famous Jacobian conjecture is the following. 


Jacobian Conjecture 10.4.15 If chark = 0, then an endomorphism ¢ of 
the polynomial algebra K[Vj,] = K[a1,...,2%m] is an automorphism if and 
only if the Jacobian matrix J(¢) of ¢ is invertible. 


A lot of work has been done on the problem (see e.g. the book by Ab- 
hyankar [1] and the survey article of Bass, Connell and Wright [24]) but the 
answer is still unknown even for m = 2. 


Exercise 10.4.16 Show that the condition chark = 0 in the Jacobian con- 
jecture is essential. 


Hint. Show that for p prime, the Jacobian matrix of @ € EndZ,[x], where 
é(“) = «+ 2?, is invertible and ¢ is not an automorphism. 


Remark 10.4.17 There are many generalizations of the partial derivatives 
0/0x; in the case of free and relatively free algebras and one can define the 
Jacobian matrix of dé € EndA(V,,) and ¢ € EndF,,, (2). Usually this Jacobian 
matrix satisfies the chain rule of Exercise 10.2.9 and one may ask an analogue 
of the Jacobian conjecture for free and relatively free algebras. There are also 
some other analogues of the Jacobian matrix, which are endomorphisms and 
not matrices. For details for A (Vj) see the paper of Dicks and Lewin [59] 


176 10. Automorphisms of Free Algebras 


and the book by Schofield [234]. For an exposition on the general case see 
the survey article by Drensky [81]. 


10.5 Automorphisms of Relatively Free Algebras 


Till the end of the chapter we shall shortly discuss automorphisms of rela- 
tively free algebras. We start with the theorem of Bergman [34] which gives 
a new class of automorphisms of relatively free algebras and shows that im- 
portant relatively free algebras of rank 2 have wild automorphisms. 


Theorem 10.5.1 (Bergman [84]) Let P2(Mj,(K)) be the relatively free al- 
gebra of rank 2 in the variety generated by the k x k matrix algebra, 
k > 1 (i.e. Fo(My(K)) is the algebra of two generic k x k matrices). 
Let u(x,y) # 0 be a polynomial without constant term, in the centre of 
F5(M,(K)) and such that all variables participate in commutators only (i.e. 
u(,y) © Bo/(T(M;(K))N Bo) in the notation of Definition 4.3.1). Then the 
endomorphism of F2(M,(K)) 


bu = (@ + ule, y), y) 


ts a wild automorphism. 


Proof. We consider the canonical algebra homomorphisms 
Ka, y) — Fo(Mi(K)) — K[e, y] 
which induce group homomorphisms 
Aut K(x, y) —> Aut fo(M,(K)) — AutK[a, y]. 
By Corollary 10.3.8, the homomorphism 
AutK(«, y) — AutK[x, y] 

is an isomorphism. This implies that 

Aut K(x, y) —> Aut Fo(M;(K)) 
1s a monomorphism and 

Aut Py(M,(K)) — AutK[s, y] 


is an epimorphism. We shall prove the theorem if we find a nontrivial au- 
tomorphism of F2(M2(K)), and it will be ¢,, which is in the kernel of the 
homomorphism 

Aut P)(M;,(K)) — Autk[a, y]. 
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Let v(#,y) be another element of Ff2(M;,(K)) with the same properties as 
u. Since all variables participate in v(#,y) in commutators only, and u is a 
central element in F)(M;,(A)), we obtain that 


v(x + u(x,y),y) = v(x, y), 
bu © Gy (@) = bulw + v(x, y)) = bulw) + v(bu(), duly) = 
=x+u(x,y) +(x t u(z,y),y) = 2+ ulx,y) + v(@,y) = bute(x), 
duo bu(y) = Guly) = y = butu(y) 


and ¢y ody = duty. Hence ¢y is invertible with @7! = b(-u). Clearly, ou 
induces on K[a,y] the identity map and this completes the proof of the 
theorem. 


Example 10.5.2 Let & = 2 and let u = [x, y]?. We know that u is in the 
centre of the algebra of two generic 2 x 2 matrices (see Exercise 7.3.2). By 
Bergman Theorem 10.5.1, 


gu = (x + [x,y], y) 


is a wild automorphism. 


Remark 10.5.3 The algebra of two generic 2 x 2 matrices has an important 
property which does not hold for other algebras of generic matrices. We refer 
to the book by Formanek [108] or to the paper by Alev and Le Bruyn [9]. We 
shall only mention the following. If x and y are two generic 2 x 2 matrices 
and we consider the commutative algebra C’ generated by 


tr(z), tr(y), det(x), det(y), tr(xy), 


then the algebra C' is isomorphic to the polynomial algebra in five commuting 
variables. The automorphisms ¢, in Theorem 10.5.1 induce automorphisms 
of C' and there was a hope that some “strange” automorphisms of the generic 
matrix algebra will induce “strange” automorphisms of the polynomial alge- 
bra. It turns out that in this way one can induce for example the Nagata 
automorphism in Exercise 10.4.4 (but of the algebra of 5 variables, where it 
is tame) and many other interseting automorphisms. See also the paper by 
Drensky and Gupta [89]. 


Remark 10.5.4 Using the idea of the proof of Theorem 10.5.1 combined 
with structure theory of relatively free algebras, one can show that for any 
noncommutative Pl-algebra R, the relatively free algebra F2(R) has wild 
automorphisms, see the paper by Drensky [79]. There are also other methods 
for constructing wild automorphisms of relatively free algebras, see e.g. the 
survey article [81]. 


11. Free Lie Algebras and Their 
Automorphisms 


There are many books on the theory of free Lie algebras, as the books 
by Bourbaki [87], Bahturin [21], Reutenauer [228] and the recent book by 
Mikhalev and Zolotykh [184] on free Lie superalgebras (with a long list of 
references, including very recent results), etc. In this chapter we first survey 
some results on free Lie algebras which show that their combinatorics is very 
different from the combinatorics of free associative algebras. The proofs can 
be found e.g. in the books by Bahturin [21] or Mikhalev and Zolotykh [184]. 

Comparing the results on automorphisms of free algebras in the case of 
commutative, associative and Lie algebras, and on automorphisms of free 
groups, the picture is much better in the case of groups and Lie algebras. We 
show how the combinatorial results on free Lie algebras imply the tameness 
of their groups of automorphisms. We also state and prove the Jacobian 
conjecture for free Lie algebras. Finally, we deal with automorphisms of the 
free metabelian Lie algebra of rank 2 and automorphisms of relatively free 
nilpotent Lie algebras. 


11.1 Bases and Subalgebras of Free Lie Algebras 


We assume that K is any field. We fix an integer m > 1 and denote 
by A(V,) the free associative algebra freely generated by x1,...,%m (and 
Vin = span{x1,...,@m}). By Witt Theorem 1.3.5, the free Lie algebra Ly, is 
isomorphic to the Lie subalgebra generated by 11,...,%m in K(V,). Hence, 
every element of L,,, is a linear combination of associative monomials (words) 
IN £1,...,2%m. In the literature, there are different bases of L,, as a vector 
space, e.g. these of Hall [123], Lyndon [174] and Shirshov [239]. We give with- 
out proof the description of the basis of Lyndon-Shirshov in the version of 
Shirshov. 

We introduce the lexicographic ordering on the set of all nonempty words 
In 21,...,€m assuming that 


US Ui... Vi, A Uj, 1... Uj, =U 


if ¢ = j1,...,% = Je for some k > 0 and tg4i < Jxyi. Additionally, if p > ¢ 
and i] = ji,...,%¢ = jg (ie. v is a beginning of u), we also assume that u < v. 
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For a noncommutative word [u] with some distribution of the Lie brackets 
(e.g. [u] = [xo,[x1, ee]]), we denote by wu = [u] the associative carrier of 
[u], i.e. the associative word obtained by deleting the brackets (e.g. if [u] = 
[wo, (a1, eo]], then u = [u] = vou a). 


Definition 11.1.1 Let 


be the set of Lie commutators in A(V,,) constructed inductively in the fol- 
lowing way. 

1. Let [Uy] = {x1,...,am}. (We write «; instead of [a,].) 

2. If we have already constructed [U;,] for k = 1,...,n —1, then [U,] 
consists of all commutators [w] of degree n such that [w] = [[u], [v]], where 
[u] € [Ue], [ve [Un—x] and [uw], [v], [w] satisfy the conditions 
(i) [u] > [w] > [e]; et 
(i) If fu] = [lu], [uel], then [ua] = [e) 


Example 11.1.2 Let m = 2 and « ~ y. Then 


[Ui] = tx, yt, [2] = tly, 2]} 


and y > y® > 2; 


[Us] = tly, 2], ], Ly ty, el, 


Ym YYt ~ YE ~ YEE & &; 
[Va] = tly, ly, ly, el), fy, (lye), el), (ly, #], 2], 2}, 


Y > YYYE > YYE > YYLE > YE > YE > YELL > 2. 


Theorem 11.1.3 (Lyndon [174], Shirshov [239]) The set [U] constructed in 
Definition 11.1.1 is a basis of the free Lie algebra L,,. This basis is called the 
basis of Lyndon-Shirshov. 


For a polynomial g € L,, we denote by g’ the homogeneous component of 
highest degree (with respect to the usual degree defined as the length of the 
monomials in A(Vj,)). We say that the nonzero elements g1,..., 9% of Lm 
satisfy a nontrivial relation, if there exists a nonzero Lie polynomial 


F(zi,..-,2n) © Le C K(a,..., 2), 


such that f(g1,..-, 9%) = 9. 
Based on his combinatorics of words, Shirshov [237] proved the following 
result which is crucial in our considerations. 


Proposition 11.1.4 (Shirshov [237]) Let gi,...,g,% be elements of Lm, which 
satisfy some nontrivial relation. Then one of the homogeneous components 
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of highest degree g',...,9, belongs to the Lie algebra generated by the other 
components of highest degree. 


The proof of Proposition 11.1.4 is a consequence of the proof of [237, 
Theorem 2]. It may also be found in [184] in the more general setup of free 
Lie superalgebras. I believe that having the experience with the combinatorics 
of words in Chapter 9, the reader should be able to understand the proof e.g. 
in [184]. 


Example 11.1.5 Let 


n= (lz, y, x], [x,y] ~ [x,y], §2 = [@,¥, «] + 2[z, yl, 93 = [x,y], 


(these elements are not written as linear combinations of the basis elements 
of Lyndon-Shirshov). Then 


[1 + 93, [92, 93] = 0, 


and the homogeneous components of highest degree 
9, = (le, y, 0], [x ull, 95 = [ey #], 93 = 93 = [2,9 
satisfy 91 = (go, 9s]. 


Using his Proposition 11.1.4, Shirshov [237] discovered an algorithm find- 
ing a minimal system of generators for a subalgebra of L,, generated by a 
given system of polynomials and proved the theorem for the freedom of the 
subalgebras of free Lie algebras. 


Theorem 11.1.6 (Shirshov, [237]) Let L(X) be the free Lie algebra generated 
by a set X. Then every Lie subalgebra of L(X) is free. 


Theorem 11.1.6 is a Lie analogue of the Nielsen-Schreier theorem which 
states that every subgroup of the free group is free again. Two different proofs 
of the Nielsen-Schreier theorem are given for example in [176] and [135]. 


Exercise 11.1.7 Prove the following partial case of Theorem 11.1.6. Every 
finite set of elements {g1,..., 94} of the free Lie algebra L,, generates a free 
Lie subalgebra. 


Hint. Let G' be the subalgebra generated by g1,..., 9%. If the leading homo- 
geneous components gj,..., 9; are independent (1.e. satisfy no relation), then 
by Proposition 11.1.4 the elements g1,...,g, themselves generate a free Lie 
algebra. If g/,..., 9, are dependent, then one of them, say gj, belongs to the 
subalgebra generated by g,...,9;. Let f(z2,...,2%) be an element of the 
free Lie algebra L,_1 generated by z,...,2z,% such that gi = f(g5,.--,9%)- 
We define a grading on Lx_1 by degz; = degg}. Without loss of generality we 
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may assume that f(z2,..., 2%) is homogeneous of degree degg| = degg. The 
elements 


hy = 91 — f(ge,---5 9), hi = Gi, C= 2,...,k, 
generate the same subalgebra G as qj,...,gz. If hy = 0, then we apply 


induction on the number of the generators of G. If hy # 0, then the degree 
of h; is smaller that the degree of g; and we apply induction on yw degg;. 


11.2 Automorphisms of Free Lie Algebras 


Cohn [50] described the automorphisms of the free Lie algebra Ly, of any 
finite rank m. Now this result is considered as classical. As in the case of 
polynomial and free associative algebras, an automorphism of L,, is tame if 
it is a composition of linear and triangular automorphisms. 


Theorem 11.2.1 (Cohn [50]) For every integer m > 2 the automorphisms 
of the free Lie algebra L», are tame. 


Idea of the Proof. The original proof of Cohn [50] is based on the general- 
ization of the Euclidean algorithm to the noncommutative case and on the 
technique of free ideal rings. Another proof based on the Shirshov technique 
in free Lie algebras is given by Kukin [158]. Here we sketch his proof. It re- 
peats the main steps of the hint of Exercise 11.1.7. Let @ € AutL,, and let 
gi = O(a), i= 1,...,m. Clearly, {g1,..., gm} is a system of free generators 
of Lm. We apply induction on 57;"., degg;. If this sum is equal to m, then 
all g; are linear combinations of #1,...,@m. Since ¢ is an automorphism, we 
obtain that ¢ is an invertible linear transformation of V,,, and ¢ is a tame au- 
tomorphism. Let 577", degg; > m. Since g1,...,9m generate Ly, there exists 
Lie polynomials f;(21,..., 2m) without linear terms and constants a;; € K, 
t,j =—1,...,m, such that 


m 
ry = So aijgit fi(gis---.9m), J =1,-...m. 
t=1 


If all f;(21,..., 2m) are equal to 0, then the leading homogeneous components 
Gis -++5Gm are linearly dependent and one of them is a Lie polynomial (of first 
degree) of the others. If some f; is nonzero, then the leading homogeneous 
components g{,...,g/, are dependent because in the left hand side of the 
above equations there are monomials of first degree only. Again, gj,...,9/, 
are dependent and, by Proposition 11.1.4, one of them is a polynomial of the 
others. If gi = f(95,---,9),), then, as in Exercise 11.1.7, the elements 


hy = 91 — f(g2,---s 9m); hi = Gs, P= 2,...,mM, 


generate LD, and 
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S- degh; < S- degg;. 
t=1 t=1 


The mapping g; > A;, i = 1,...,m, induces a triangular automorphism 7 
of Ly, (with respect to the system of free generators {g1,...,9m} of Lm). 
Hence, the composition w = ro ¢ is an automorphism of L,, and 


w(es) = 7(O(a;)) = Tg) = hg, 7H 1,...,m. 


By induction, w is tame and this implies that @ is tame. With some addi- 
tional considerations, this proof gives also an algorithm how to decompose 








an automorphism into a product of linear and triangular automorphisms. 


Remark 11.2.2 Since the only Lie elements in one variable « are its scalar 
multiples az, a € K, for m= 2 the Cohn Theorem 11.2.1 gives that 


We continue our exposition with the Lie algebra analogue of Jacobian 
Conjecture 10.4.15. Such considerations were first carried out for free groups 
by Birman [35] who solved the Jacobian conjecture for free groups. The fur- 
ther development of the approach of Birman is very useful also for free and 
relatively free associative algebras. It helps to show that some classes of en- 
domorphisms are not automorphisms and to construct wild automorphisms 
of relatively free algebras. First we introduce partial derivatives. 


Definition 11.2.3 (i) Let f(a1,...,#m) be an element of the free associative 
algebra K(Vin) = K(01,...,%m). Write f in the form 


f= f(a1,...,&m) =at) wifiler,...,2m), ae Kf; € Kn). 
t=1 


We call the polynomials 


Of — Of 
Day = Dewy a iI tm) PS Ls ’ 
the (right) partial (For) derivatives of f. 

(ii) If @ is an endomorphism of K(V,,), the (right) Jacobian matrix of ¢ 
is the m x m matrix with entries from K (Vy) 


1(0) = Je(6) = (RR). 








Similarly we may introduce left partial Fox derivatives 0;/Q:a; and the left 
Jacobian matrix J;(¢) of an endomorphism ¢. 
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Exercise 11.2.4 Show that the Jacobian matrix in Definition 11.2.3 satisfies 
the chain rule 


Ir(G 00) = Ir(9)OUr(Y)), Hi(G 0) = OY) IG), 6, ¥ € End K (Vin), 


where ¢(.J(z)) means that ¢ acts on the entries of the matrix and * states for 
the transpose matrix. Derive from here that if @ is an automorphism, then 
its right Jacobian matrix has an inverse from the right. 


Hint. Let e.g. m= 2 and ¢= (f,g), & = (p,¢). Show that 
O,P(f, 9) _ Or f Opp Org On P Avlf, 9) _ Op OF Op Og 


ax Ov f | Oe eg Ax Afax |) Ag ex’ 














and similarly for the other derivatives. 


Definition 11.2.5 Since we assume that the free Lie algebra Li is a Lie 
subalgebra of the free associative algebra A(V,,), and every endomorphism 


@ of Lm is determined by the images of x1,...,%m, we obtain that @ induces 
an endomorphism ¢ of K(V;,). We define the Jacobian matrix J(@) of @ as 
J(d) = J(@). 


Exercise 11.2.6 (Reutenauer [227]) Let m = 2 and let ¢ € EndK (x, y) be 
defined by 

o(x) =x + yoy, oy) = y. 
Show that both the left and the right Jacobian matrices of ¢ are invertible 
but ¢ is not an automorphism of K(x, y). 


Hint. Use, for example, the commutator test of Theorem 10.3.9 to see that 
@ is not an automorphism: 


[o(x), é(y)] = [z, yl + yey” _ yay # aa, y), aE K. 
On the other hand, 


mo=(, {)a@= (1, 7). 


Exercise 11.2.6 shows that the Jacobian matrix J(¢) of Definition 11.2.3 
does not carry enough information about the endomorphism ¢ of K (Vin) 
even for m = 2. The “correct” definition of J(¢) for ¢ € EndK(V,,) is this 
of [59] and [234], see Remark 10.4.17. Nevertheless the following theorem of 
Reutenauer-Shpilrain-Umirbaev gives that for free Lie algebras the situation 
is completely another and our definition of the Jacobian matrix of ¢ € Endl, 
is sufficient for our study of the automorphisms of Z,,. In particular, the 
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theorem gives the affirmative answer to the Lie analogue of the Jacobian 
conjecture. 


Theorem 11.2.7 (Reutenauer [227], Shpilrain [241], Umirbaev [254]) Let ¢ 
be an endomorphism of the free Lie algebra Ly,. The right Jacobian matrix 
of @ is invertible from the right (as a matrix with entries from K (Vpn) ) if and 
only if @ is an automorphism. 


Proof. We follow the arguments of Umirbaev. If ¢ € AutL,, then, by the 
chain rule of Exercise 11.2.4, 


e=I,(¢067*) = J(9)O(J(G™)), 


where e is the unity m x m matrix. In this way we obtain the “easy” part 
of the theorem. Now, let J(¢) = J-(¢) be invertible over K(V;,) and let 
J,(¢)a = e for some m x m matrix a with entries from K(V,,). By Definition 





11.2.3 of Fox derivatives, and since f; = ¢(x;), j =1,...,m, are polynomials 
without constant terms, we obtain that 
Of; Of; 
;= ayo t+...4+2 =1,...,m 
f; 1 Any + + m Oxtm iJ ) } } 


and, in matrix form, 


(fi,.--,fm)a = (@1,...,@m)J(d)a = (41,..., 2m). 


Hence #1,..., £m belong to the right ideal generated by f,,..., fm in the uni- 
versal enveloping algebra U(Lm) = K(Vm). Obviously fi,..., fm belong to 
the right ideal generated by x1,..., @m, i.e. these two right ideals of U (Ly) co- 
incide. By Proposition 1.3.8, the Lie subalgebra of L,, generated by fi,..., fm 
coincides with the whole L,, and the endomorphism ¢ is an epimorphism. As 
in the case of polynomial and free associative algebras (see Exercise 10.1.11), 
this implies that ¢ is an automorphism. 


Comparing the final form of the results on automorphisms of free groups 
and free Lie algebras with the partial results for polynomial and free asso- 
ciative algebras it seems to us that the main difference between groups and 
Lie algebras on the one hand and polynomial and free associative algebras 
on the other lies in the good combinatorics in the group and Lie cases. For 
example, the subgroups of the free groups are free again (the Nielsen-Schreier 
theorem) and a similar statement holds for the subalgebras of the free Lie 
algebras (Theorem 11.1.6 of Shirshov). There exist algorithms which, for any 
set of generators of the free group and the free Lie algebra, give a minimal 
system of free generators and transform this system to the canonical system 
of free generators {x1,...,&m} (see [176] for groups and [237] and [158] for Lie 
algebras). Each step of the algorithms corresponds to a tame automorphism. 
Unfortunately, these arguments do not work in the associative case. 
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11.3 Automorphisms of Relatively Free Lie Algebras 


Very often the investigations on relatively free Lie algebras and, in partic- 
ular, on their automorphisms, are inspired by similar investigations both in 
group theory and theory of associative Pl-algebras. Maybe this is in the very 
nature of the relatively free Lie algebras. Some of them, as the free solvable 
or, more generally, the free polynilpotent Lie algebras, are direct analogues of 
the corresponding group objects. Sometimes the study of the group requires 
the study of the Lie algebra (as a “linearization” of the group problem). 
Sometimes the group theoretic result serves as a model or as a conjecture 
for the Lie algebra result. On the other hand, combinatorial methods devel- 
oped for associative algebras often work also for Lie algebras. The results of 
the present section are a combination of both associative algebra and group 
trends in the study of automorphisms of relatively free Lie algebras. 

As in the case of relatively free associative algebras (see the proof of 
Theorem 10.5.1), Remark 11.2.2 gives possibilities to construct wild auto- 
morphisms for relatively free Lie algebras of rank 2. 


Exercise 11.3.1 Let L2/L4 be the free metabelian Lie algebra generated by 
«and y. Let 04 w € 15/14. Show that the endomorphism exp(adw) defined 
by 

exp(adw)(u) = ut [u, wl, u € Lo/L4, 


is an automorphism. Such an automorphism exp(adw) is called inner. Show 
that the inner automorphisms act identically on L5/L4. 


Hint. Let ¢ = exp(adw). By Exercise 3.2.2, adw is a derivation of Lo/LY 
and, since w € LS/L4, we obtain that ad’w = 0. Hence adw is nilpotent and 
by Exercise 10.4.1, @ = exp(adw) is an automorphism. Clearly, if u € £5/L4, 
then [u, w] = 0 and ¢(u) = u. Pay attention that the result does not depend 
on the characteristic of the field K, because ad?w = 0, the only denominator 
in the expression of exp(adw) is 1! in adw/1! and the exponent is well defined. 
The exercise can be also handled directly, without using Exercises 3.2.2 
and 10.4.1. Verify that d([u, v]) = [d(u), d(v)], uv © Le/L4. For this pur- 
pose, using the anticommutativity, the Jacobi identity, and the metabelian 
identities [[u, w], [v, w]] = 0 and [[u, v], w] = 0 (since w € 24/15) show that 


[o(u), O()] = [w+ [u, w], 0+ [v, wl] = 
= [uo] + [u, [v, w]) + [fu vw], o] = [uo] = (Lu, ef), 


i.e. ¢ is an automorphism of L2/L5 which acts identically on L5/L4. 


The above Exercise 11.3.1 is a partial case of a result of Shmelkin [240] 
(based on techniques coming from group theory), that the automorphism of 
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the free solvable Lie algebra Lm / LS is inner if and only if it acts identically 
on LE-Y Le), 


Exercise 11.3.2 Show that every nonidentical inner automorphism of the 
free metabelian Lie algebra of rank 2 is wild. 


Hint. Use the same idea as in the proof of Bergman Theorem 10.5.1. Let 
I2/L5 be the two-dimensional abelian Lie algebra (i.e. a two-dimensional 
vector space with trivial multiplication and automorphism group isomorphic 
to Glo(K)). Show that the Lie algebra homomorphisms 


Lig —_ De/ Ly —_ De/L5 
induce group homomorphisms 


Use that every nonidentical automorphism of L2/L3 which induces the iden- 
tity map on L2/L4, is wild. 


There is a general theorem of Bahturin and Nabiyev [22], which states 
that for every m > 2, the inner automorphisms of the free metabelian Lie 
algebra Lm /L', are wild. 


Exercise 11.3.3 Let Fo(slo(K)) = Lo/(LeaNT(sl2(K)) be the relatively free 
algebra of the variety of Lie algebras generated by the Lie algebra slo(K) 
of traceless 2 x 2 matrices over a field of characteristic 0. Show that the 


endomorphism @ of Fo(slo(K)) defined by 


o(v) = 2+ [[z,y, y],[z, ull, oy) = y, 


is a wild automorphism. 


Hint. Show that [[[x, y, y], [z, yl], y] = 0 is a polynomial identity for slo(K) 
and, as in Theorem 10.5.1, the inverse of ¢ is vw, where 


U(v) =2 [ley yl, feu], YY) =v. 


Now we shall describe the automorphism group of the free metabelian 
Lie algebra of rank 2. The result follows from a theorem of Shmelkin [240] 
and is stated in this form in [79]. For further study of AutLo/LY involving 
also methods from representation theory of GL2(K), see the recent paper by 
Papistas [204]. 


Theorem 11.3.4 Every automorphism of the free metabelian Lie algebra 
Io/L of rank 2 is a composition of a linear and an inner automorphism. 
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Proof. Let @ be an automorphism of L2/L4 and let 


o(@) = aa + Byt+ fla,y), Oly) = ye + oy + gla, y), 


where a, 3,y,6 € K and f,g € 15/L%. Since ¢ induces an automorphism of 
the two-dimensional abelian Lie algebra L2/L5, we obtain that ¢ = wor, 
where 7 is the linear automorphism 


T(x) = ax + By, rly) = ya + dy 


and wy induces the identity map modulo the commutator ideal. Hence, without 
loss of generality we may assume that 


o(a)=a4+ f(x,y), oy) =ytg(a,y), fog € £5/L5, 


and we have to prove that # is inner. The commutator ideal £5/L has a 
basis 


{[x, yl(ad?x)(ad*y) | p,q > O}. 


As in Chapter 6 (see the proof of Theorem 6.2.1) we define an action on 
L5/L5 of the polynomial algebra K[é, u] in two commuting variables t and u 
by 

tu’ « [a y](ad?x)(ad4y) = [x, y](ad? x) (ad?**y). 


We write the images of x and y under ¢ and ¢7! as 
o(w)=x+ax [x,y], (y) =yt+bx [x,y], 


g'(e)=a+ex[z,y], &'(y) = y+ [e, y, 
where a,b,c,d € K[t, uj. Let for example a ¥ 0. Direct calculations show that 





« = (gog7')(x) = o(a+ex[x, yl) = etax[e, y]+cx[e+ar[a, y], y+be[2, yl] = 
=x+ax[x,y]+ex ([z,y] +a [[z, y],y] — 6 * [x,y], 2]) = 
=«e4+(a+c(1+ au — dt)) « [z, y]. 

Since the action of K[t, u] on 25/29 is faithful, we obtain that 





a+c(1+au— bt) =0. 
Similarly, we derive from x = (¢~1 o ¢)(x) 

e+a(1+cu-—dt)=0. 
The above two equations give that 


e=—a(l+cu—dt),a=-—c(1+ au — bt) = a(1+ au — bt)(1+ cu — dt). 





Since A[t, u] has no zero divisors and a 4 0, we obtain that 





(1+ au— bt)(1+cu—dt)=1 
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and this implies au = bt. Hence a = v(t, u)t, b = v(t, u)u for some v(t, u) € 
K[t, u] and this means that 


o(a) =a + v(t, u) * (C(x, yl) = « — [2,0 * [2, yl] = exp(ad(—v + [x, y))(2), 


o(y) =y + v(t, u) * (us [a, y]) = y — [y, v * [x, yl] = exp(ad(—v « [x, yl) (y), 


1.e. @ 1s an inner automorphism. 


It is interesting to compare the structure of the automorphism groups of 
the free metabelian group G»,/G", (where G', is the free group of rank m) 
and the free metabelian Lie algebra L,,/Li,,. The results of Chein [47], Bach- 
muth and Mochizuki [17, 18, 19] and Romankov [229] give that AutG,/G/, 
consists of tame automorphisms for all m 4 3 and AutG3/G3 is not finitely 
generated. Hence, by Theorem 10.1.2, Gs/G has a lot of wild automor- 
phisms. On the other hand, the only wild automorphisms of AutL,/L/, arise 
from the result of Bahturin and Nabiyev [22] for the wildness of the inner 
automorphisms of L,,/L',. The group analogues of the inner automorphisms 
are the automorphisms by conjugation and they are tame for any relatively 
free group. Freely restated, the following problem asks “how wild” are the 
automorphisms of Ly, /L",. 


Problem 11.3.5 Find generators of the automorphism group of the free 
metabelian Lie algebra L,,,/L/, for m > 2. Is AutL,/L, generated by the 
tame and the inner automorphisms? 


For m > 4 and chark = 0, the tame automorphisms form quite a large 
subgroup of AutL,/L,, see the paper by Bryant and Drensky [42]. (One 


needs some knowledge of the methods of the next Chapter 12 to read this 
paper.) 


Exercise 11.3.6 Prove the analogue of Theorem 11.3.4 for the free asso- 
ciative “metabelian” algebra K(x, y)/C*, where C is the commutator ideal 
of K{a,y): Every automorphism of K(x,y)/C? is a composition of a tame 
automorphism and an inner automorphism exp(adw), where w € C/C?. 


Hint. Since C/C? has a basis 
{ary? ([x, yjad°a ad“y) | a,b,c,d> 0}, 


define an action of the polynomial algebra in four variables on C/C? as in 
Chapter 6 and repeat the considerations in the proof of Theorem 11.3.4, 
replacing the linear automorphism with an automorphism of the polynomial 
algebra K (a, y)/C. If difficulties appear, see the paper of Drensky [79]. 


There is an analogue of the Jacobian matrix for free metabelian Lie and 
associative algebras of any rank m > 2. The Jacobian matrix is a matrix 
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with entries from the polynomial algebra in m variables for the Lie algebra 
case and in 2m variables in the associative algebra case. Results of Umirbaev 
[255, 256] show that in both cases the invertibility of the Jacobian matrix 
implies the invertibility of the endomorphism, i.e. the Jacobian conjecture 
has a positive solution for metabelian algebras. 

Finally, we show that the nilpotent relatively free algebras have a lot of 
wild automorphisms. For chark = 0 the result is a partial case of a result 
of Drensky and C.K. Gupta [88] based on applications of the representation 
theory of groups. A direct proof for the general case is in the paper of Bryant 
and Drensky [41] and is based on a method developed by Bryant, Gupta, 
Levin and Mochizuki [43] for constructing wild automorphisms of free nilpo- 
tent groups. 

The following exercise is a well known fact for nilpotent algebras. Properly 
restated, it holds also for nilpotent groups. 


Exercise 11.3.7 Let G be a finitely generated nilpotent (not necessarily 
Lie or associative) A-algebra. Show that every endomorphism ¢ of G which 
induces an invertible linear transformation on the vector space G/G?, is an 
automorphism. 


Hint. Show that ¢ induces invertible linear transformations ¢, on the factors 
G*/G***_ Tf g1,..., 9% © G, find elements hi,...,hy € G such that ¢(h;) = 
gi (mod G?); then ¢(hy...he) = gi..-ge (mod G*t!). Hence ¢% is onto 
G*/G*+!, Since G is finitely generated, the vector spaces G*/G**! are finite 
dimensional, and the “onto” linear transformations are invertible. Use that 
G” = 0 for some sufficiently large n, and derive that ¢ is a 1-1 correspondence. 





Theorem 11.3.8 Let T(G) be the T-ideal of a nilpotent but nonabelian 
Lie algebra G. Then for every m > 2, the relatively free algebra Fiy,(G) = 
Im/(T(G) A Ly) has wild automorphisms. 


Proof. Since G is not abelian, [21,22] # 0 in the relatively free algebra 
F,,(G). Additionally, G does not satisfy any polynomial identity of the form 
[v1,%2] + f = 0, where f € L? and FL is the free Lie algebra of countable 
rank. (This is obvious if the base field K is infinite because the T-ideal T(G) 
is generated by its homogeneous elements. Over a finite field A one needs 
some additional, but not very difficult inductive arguments on the class of 
nilpotency c of G: In order to make the inductive step, we use that 


[[v1, #2] + f,e3,...,@e] = [@1, €2,43,...,¢-] +[f,23,..., ae] = 9 


is a polynomial identity for G and derive that [#1,...,#-] belongs to T(G) + 
L°*+, Hence G is nilpotent of class c, which is a contradiction.) Since Fin, (G) 
is nilpotent, by Exercise 11.3.7 the endomorphism ¢ defined by 


(#1) = #1 + [21, v2], O(@;) = 21,7 = 2,...,m, 
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is an automorphism. We shall show that this automorphism is wild. Let the 
automorphism w of Ly, be a lifting of ¢. Then 
w(e1) =xXit [a1, Xa] + fi, w(x) = tj + fi, t= 2, ree ™, 


where f1,..., fm belong to L°. Hence, modulo the ideal of K(Vn) of the 
polynomials without constant and linear terms, the right Jacobian matrix 


J(w) of a is equal to 


l+a 0 0... 0 0 
—x, 1 0 0 0 
0 0 1 0 0 
0 0 0... 1 0 
0 0 0... 0 1 


Since # is an automorphism, the matrix J(7)) is invertible. Hence its image 
J(w) in the set of m x m matrices with commuting polynomial entries is also 
invertible and its determinant is a nonzero constant in the field. Concrete 
calculations show that det(J(w)) is equal to 








1+22+ terms of higher degree 


which shows that # cannot be an automorphism. 


For further reading on automorphisms of relatively free Lie algebras see 
the survey article by the author [81]. 


12. The Method of Representation Theory 


In this chapter we deal with the powerful method of representation theory of 
groups in the study of Pl-algebras. To the best of our knowledge, the origins 
of this method can be found in the papers by Malcev [180] and Specht [245] 
in 1950, where they applied for a first time representations of the symmetric 
group to the theory of Pl-algebras. Then, starting in the 70’s, the method was 
further developed, also in the language of representations of the symmetric 
group, in a series of papers by Regev (see e.g. his survey article [226]). Regev 
also involved other techniques, as asymptotic methods, combinatorics and 
even evaluations of multiple integrals, which nowadays are also considered as 
a part of the method of representation theory. In the early 80’s, the author 
[71] and Berele [29] started to use the representation theory of the general 
linear group which had appeared incidentally in some papers by Razmyslov, 
Procesi and others. A lot of work was done also by Formanek (see [105]). 
Recently, one has started to apply also representations of Lie superalgebras, 
instead of representations of the general linear group (see Berele and Regev 
(32]). 

The purpose of this chapter is to give some basic idea of the method 
with applications to the polynomial identities of the Grassmann algebra, 
the algebras of upper triangular matrices and the 2 x 2 matrices. We have 
also included some general facts about representation theory of groups and 
some results specific for the representations of the symmetric and the general 
linear groups. For more detailed background see for example the books by 
Lang [161], Kostrikin [153] or Curtis and Reiner [52]. A good source for 
knowledge on representation theory of symmetric and general linear groups 
are the books by Weyl [266], James and Kerber [129] or Macdonald [175]. 
For further reading on applications to Pl-algebras we refer e.g. to the survey 
articles by Regev [226] and the author [77, 78]. It turns out that the technique 
under consideration is very useful also in the study of automorphisms of 
relatively free algebras. For exposition see the survey article [79] and the 
references there. 

Although most of the considerations hold over any field K of character- 
istic 0, in this chapter we assume that K is an algebraically closed field of 
characteristic 0, e.g. K = C. Sometimes we shall emphasize on the algebraic 
closeness of K if it is really essential for the considerations. 
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12.1 Representations of Finite Groups 


We fix the notation G for a group. If G is finite, we denote by |G the number 
of its elements. If V is a vector space, we denote by GL(V) the group of 
all invertible linear operators of V. If dimV = m < o, fixing a basis of V, 
we identify the group GL(V) with the group GLm = GL ,»(K) of invertible 
m x m matrices with entries from K (see Exercise 1.1.4 (iii)). 


Definition 12.1.1 Let G = {g | g € G} be a group. The A-algebra KG 
with basis as a vector space {g | g € G} and with multiplication between the 
basis vectors defined by the multiplication in G is called the group algebra of 
G (see also Exercise 1.1.4 (vii)). 


Example 12.1.2 (i) Let G = {1l,a,b,c} be the elementary abelian 2-group, 
ie. the abelian group generated by a,6 with relations a? = 6? = 1 (and 
c = ab). The group algebra KG is the four-dimensional vector space with 
basis {1, a, b,c} and multiplication 


(e1 tayat Bb + y1c)(e2 + aga + Bob + y2c) = 





= (€1€2 + aa2 + 182 + y172) + (E102 + a1é2 + Piy2 + V1 82)a4 











+(€1 82+ Beg + ayye + yraa)bt (e172 + yee + a1 Go + Biaa)e. 





(ii) If G = (x) is the infinite cyclic group generated by x, then the group 
algebra KG has a basis {1,e*!,2+?,...} and multiplication «* a! = aft! 
k,l € Z. The algebra KG is isomorphic to the algebra of Laurent polynomials 
in one variable. 














Definition 12.1.3 Let G be a group and let V be a vector space. 
(i) A representation ¢ of G in V is a group homomorphism 


6:G—+GL(V). 


The degree of the representation @ is equal to the dimension of the vector 
space V. The representation @ is faithful if the kernel of ¢ is trivial; ¢ is 
trivial, if its kernel coincides with G. 

(ii) Two representations @¢ : G —> GL(V) and ¢’ : G —> GL(V’) are 
called equivalent or isomorphic, if there exists an isomorphism @ : V —> V’ 
of the vector spaces V and V‘ such that 


(9.0 6(9))(v) = (¢'(g) 0A) (ve), VEV G EG. 


(iii) If W is a subspace of V such that (¢(G))(W) = W, then the repre- 
sentation ~ :G— GL(W) defined by 


(Y(g))(w) = (PW) (e), 9G, wEW CY, 
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is called a subrepresentation of the representation ¢ : G —> GL(V). The 
subrepresentation ¢ is properifW # {0} and WV. 

(iv) If b’ : G—> GL(V") and ¢” : G — GL(V") are two representations 
of G, then the representation ¢ = ¢' @ 6": G—> GL(V’ @ VY”) defined by 


(29) (0%, 0”) = (H'(D)(%), ("MO"), GEG, (Uw EV EV", 


is called the direct sum of ¢' and @”. Similarly one defines the direct sum 
of any (finite or infinite) number of representations. The tensor product ¢ = 


¢ © 6":G—>GL(V' @V") of the representations ¢’ and ¢” is defined by 
(9) (rv) = (H'(M)() @ CMO"), GEG Var EeVav" 


(v) The representation ¢: G —> GL(V) is irreducible if it has no proper 
subrepresentations; ¢ 1s completely reducible if it is a direct sum of irreducible 
representations. 


Remark 12.1.4 Since every representation ¢: G —> GL(V) of a group G 
in V defines a linear action of G on V, we may consider V as a left KG- 
module. Sometimes we say that V is a G-module instead a KG-module. The 
main notions as subrepresentations, irreducibility, etc. for ¢ correspond to the 
similar notions for the G-module V. In particular, the regular representation 
p of G is the representation of G in KG defined by 


p(g): S- anh > S- angh,g EG, a, € K. 
heG hEeG 


Considering KG as a left G-module, we always assume that G acts on KG 
in this way, as a group of left translations. The subrepresentations of p cor- 
respond to left ideals of the group algebra AG and the irreducible subrepre- 
sentations correspond to the minimal left ideals of KG. 


Exercise 12.1.5 Let ¢: G—> GL(V) be a representation of G in the m- 
dimensional vector space V. Show that ¢ is decomposable (i.e. has a proper 
subrepresentation) if and only if there exists a basis {v1,..., Ug, Uk41,---; Um} 
of V such that the matrices of 4(g), g € G, with respect to this basis, have 


the block form 
_ {4 0 
og) = (‘ :) 


where ag, by and cg are respectively k x k, (m—k) xk and (m—k) x (m—k) 
matrices. Prove that @ is reducible, i.e. a direct sum of proper subrepresen- 
tations if and only if the basis of V can be chosen in such a way that the 
matrices bg have only zero entries for all g € G. 


Maschke Theorem 12.1.6 Every finite dimensional representation of a 
finite group G is completely reducible. If the base field K is algebraically 
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closed, then the group algebra KG is isomorphic to a direct sum of matrix 
algebras, 


KG & Ma,(K)®...® Ma,(K). 


Exercise 12.1.7 Let G be a finite group. Show that every irreducible G- 
module is isomorphic to a minimal left ideal of the group algebra KG. 


Hint. If V is an irreducible G-module and v is a fixed nonzero element of V, 
then the mapping ¢: AG — V, defined by 


6: S > agg > S > age, ag EK, 


geG geG 


is a homomorphism of G-modules (where the G-module AG corresponds to 
the regular representation of G). Since the kernel of ¢ is a submodule of 
KG, i.e. a left ideal of KG, by Maschke Theorem 12.1.6, there exists a G- 
submodule W of KG such that KG = Keré@W and V = W as G-modules. 
Since V is irreducible, we obtain that W is a minimal left ideal of KG. 


Exercise 12.1.8 Let G be a finite group and let 
KGS Ma (K)@...8 Ma (4K). 


Show that, up to equivalence, the irreducible representations ¢ : G —> 
GL(V) of G are the following. The G-module V is a d;-dimensional vec- 
tor space with the canonical action of the matrix algebra Mg,(K) C KG 
and the other matrix algebras My,(A’), j 4 4, act trivially on V (sending 
the elements of V to 0). Derive from here that every finite group has a finite 
number of nonisomorphic irreducible representations. 


Hint. Use Exercise 12.1.7 and the fact that the minimal left ideals of the 
matrix algebra Mg,(K) are d;-dimensional vector spaces which are isomor- 
phic as Mq,(K)-modules. If difficulties appear, see for example, the book by 
Herstein [126]. 


Definition 12.1.9 Let V,,...,V, be all pairwise nonisomorphic modules of 
the finite group G. Let W be a finite dimensional G-module. Let W = Wi 
... BW, be a decomposition of W into a sum of irreducible G-submodules. If 
m,; of the modules W,,...,W, are isomorphic to V;, we call the nonnegative 
integer m; the multiplicity of V; in the decomposition of W and write 


W=mV,6...8m,V,. 


The following exercise is a restatement of the famous Schur Lemma (see 
e.g. Herstein [126] or Lang [161]). 
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Exercise 12.1.10 (i) Show that the centre of the matrix algebra Ma(K) 
consists of all scalar matrices. 

(ii) Let G be a finite group, let ¢ : V —> W be a homomorphism of 
irreducible G-modules and let 0 4 v € Y. Show that either ¢ = 0 or ¢ is an 
isomorphism and there exist a unique up to a multiplicative constant element 
w € W (which depends on v only) and a nonzero constant @ € K (which 
depends on ¢) such that 


@ So aggv = BS> aggu, age K. 


geG geG 


Hint. (i) Ifa = ijn a4;ei;, Gi; © K, isin the centre of Mg(), commuting 
a with matrix units e,,, show that aj; =0 ift #7 and ay = a41. 

(ii) Use that the kernel and the image of ¢ are submodules of V and W, 
respectively, and V and W are irreducible modules. Hence, either ¢ = 0 and 
Ker@ = V or Kerd = 0, Imé = W and V = W. In the latter case, use 
Exercise 12.1.8. Assume that the vector spaces V and W coincide and {v = 
v,v2,..., va} is a basis of V. Consider ¢ as an invertible linear operator of V. 
The condition that ¢ is a G-module isomorphism implies that ¢ commutes 
with all linear operators of V, and ¢ is a scalar multiplication by (i). 


Exercise 12.1.11 (i) Let J be a minimal left ideal of the matrix algebra 
Ma(K). Show that there exists a nonzero element 7 € J such that i? = i. This 
element is called a minimal idempotent of J. Find the minimal idempotent 
when / consists of all matrices with nonzero entries in the first column only. 

(ii) Show that for every minimal left ideal J of the group algebra KG 
of the finite group G' there exists a minimal idempotent e € J such that 
Il=KG-e. 


Hint. (i) Clearly J = I. Ma(K) is a two-sided ideal of Ma(K). Since the 
matrix algebra is simple, i.e. has only trivial two-sided ideals, J = Ma(K). If 
I? =0, then 


J? = (I(Ma(K)-1))Ma(K) C I?» Ma(K) = 0, 


which is impossible. Hence 0 # J? C JI. Since I? is a left ideal, from the 
minimality of J we obtain that J? = /.If0 4a € J, Ia 40, then there exists 
an i € J such that a = ia (why?) and, therefore ia = ia, (i? — i)a = 0. The 
set Ip = {7 € 1 | ja = 0} is a left ideal of Mg(K) and either Jp = 0 or Jy = I. 
Since 7 ¢ Io, we obtain that Jo = 0 and 7? —i=0. 

(ii) By Exercise 12.1.8, every minimal left ideal of AG is a minimal left 
ideal of a matrix subalgebra of KG and we apply (i). Since the minimal left 
ideal is generated by any of its nonzero elements, we obtain that J = KG -e 
for some minimal idempotent e. 
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Definition 12.1.12 Let 6: G—> GL(V) be a finite dimensional representa- 
tion of the group G. The function yg :G —+ Kk defined by 


xo(g) = trv ($(9)), 9 EG, 


is called the character of ¢. If @ is an irreducible representation, then yg is 
called an irreducible character. 


Exercise 12.1.13 If g; and gz are two conjugate elements of the group G 
and ¢ is a finite dimensional representation of G’, show that y(g1) = xo(g2), 
i.e. the characters are constant on the conjugacy classes of the group. 


Hint. Use that the conjugate matrices have the same characteristic polyno- 
mials and hence the same traces. 


Exercise 12.1.14 Let ¢ and wv be two finite dimensional representations of 
the group G. Show that 


Xoay = Xo + Xv. Xo@v = XG *Xy- 


Hint. Let V and W be the G-modules corresponding to the representations @ 
and w, respectively. Choose bases of V and W and construct bases of V 6 W 
and V @ W. Express the traces of (¢ @ w)(g) and (¢@ © #)(g) by the traces of 


é(g) and #(g) for g EG. 


The following theorem shows that the knowledge of the character gives a 
lot of information for the representation and the number of the irreducible 
representations (which is a ring theoretic property) is determined by a purely 
group property of the group. Usually, in the text books the second part of 
the theorem is derived as a consequence of the first part. 


Theorem 12.1.15 Let G be a finite group and let the field K be algebraically 
closed. 

(i) Every finite dimensional representation of G is determined up to iso- 
morphism by its character. 

(ii) The number of the nonisomorphic irreducible representations of G is 
equal to the number of conjugacy classes of G. 


Remark 12.1.16 For groups G of small order one gives the table of irre- 
ducible characters of G. The rows of the table are labeled by the irreducible 
characters and the columns by representatives of the conjugacy classes. The 
entries of the table are equal to the values of the characters on the corre- 
sponding conjugacy classes. 
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Exercise 12.1.17 (i) Let G = (g | g” = 1) be the cyclic group of order n 
and let ¢ be a primitive n-th root of 1. Show that the irreducible characters 
xj, 9 =90,1,...,n—1, of G are defined by 


xj(g*) =e", k=0,1,...,.n-1. 


(ii) If G is a direct product of the cyclic groups (g | g” = 1) and (A | 
= 1), show that all irreducible characters of G are 


hn 
Xpq(g* h') = E*P 4, p=0,1,...,m—1,¢=0,1,...,n—1, 


where € and 7 are primitive m-th and n-th roots of 1, respectively. 


Hint. (i) Show that the mapping ¢; : g* + e/*, k = 0,1,...,n—l, isa 
one-dimensional representation of G. By Theorem 12.1.15, the group G has 
exactly n irreducible representations with pairwise different characters. Use 
similar arguments for (ii). 


Exercise 12.1.18 Let D be the dihedral group of order 8, i.e. D is the group 
of symmetries of the square (all congruencies of the plane which fix a given 
square); D is generated by a rotation r and a reflection s with defining rela- 
tions 


ras? =1,srs=r'. 


(i) Show that D = {1,r,r?,r°, s, rs, r7s,r°s} and the conjugacy classes of 
D are 
{1}, {7}, fr, rh, {5,175}, firs, rs}. 


(ii) Show that the table of irreducible characters of D coincides with the 
table given in Table 12.1. 


Table 12.1. The character table of the dihedral group of order 8 
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Hint. (ii) Every representation of the factor group 


D=D/(r?) = |r? =1) x (8 | 8° = 1) 


can be considered as a representation of D. In this way, Exercise 12.1.17 
gives the first four irreducible characters. The group D has an irreducible 
two-dimensional representation @ as the group of symmetries of the square. 
Fixing a basis of the two-dimensional vector space, we may assume that 


om= (7 B)em= (V1) 


and concrete calculations give the fifth irreducible character of D. 





Exercise 12.1.19 Let H be the quaternion group, i.e. H = {+1,+i,+j, +k} 
and the multiplication of H is given by 




















Papak=-1,ij=—ji=k, jk = —kj =i, ki=—ik =j. 





(i) Show that the conjugacy classes of H are 


t,t, tt, tea}, tee}. 


(ii) Show that the table of irreducible characters of H is the given in Table 
12.2. 

















Table 12.2. The character table of the quaternion group 





Hint. (ii) The factor group 
H=H{(-1)2G |? =1)x G/F =1) 


gives the first four irreducible characters. The group H has an irreducible 
two-dimensional representation ¢ defined (for AK = C) by 
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. i 0 . 0 1 
o(2) = (; °.).e)= (2 s) 
(where i? = —1 in C) which gives the fifth irreducible character of H. 


Comparing the character tables of the dihedral group of order 8 and the 
quaternion group given in Exercises 12.1.18 and 12.1.19, we see that D and 
HT have the same tables of irreducible characters, although D and H are 
not isomorphic. Compare this with Theorem 12.1.15, where for a fixed finite 
group the character determines the representation. 


Exercise 12.1.20 Let the field K be algebraically closed and let p be the 
regular representation of the finite group G. 
(i) Show that y,(1) = |G] and y,(g) = 0, ifg #1. 
(ii) If y1,..., xr are all irreducible characters of G and d; is the degree of 
Xi, prove that 
Xp = dixit...+drxr. 


Hint. (i) Calculate the trace of the matrix p(g) with respect to the basis 
{g |g € G} of KG. (ii) Apply Maschke Theorem 12.1.6 and Exercise 12.1.8. 
Use that the matrix algebra Mq(K) is a direct sum of d minimal left ideals, 
which are d-dimensional. 


12.2 The Symmetric Group 


In this section we summarize the necessary background on the representation 
theory of the symmetric group S;,. 


Exercise 12.2.1 Let o be a permutation of the symmetric group S, pre- 
sented as a product of independent cycles 


o> (i... .tp) (jr... jg)... (hr. hr). 


Show that the conjugacy class 7°” consists of all permutations with the same 
cyclic decomposition as o. 


Hint. Show that ¢ = rpr—', where 
p=(l...p)(p+1...p+q)...(n—r+1...n), 
=(} .. p ptl ... pt ... no-rtl ... "| 
T=. . . . . 
twee by qt Lee Jaq Lee ky ee fe 


Definition 12.2.2 A partition of n (in not more than & parts) is a k-tuple 
of nonnegative integers 4 = (Ai,..., Ag) in decreasing order (i.e. Ay >... > 
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Ay > 0) and such that Ay +...+ A, = n. We use the notation A - n or 
A € Part(n). We identify two partitions if they differ by a string of zeros 
in the end. For example, A = (4,2, 1) = (4,2,1,0) = (4,2,1,0,0) F 7. It is 
also convenient to indicate the number of times each integer occurs in the 
partition and to write for example (3, 27, 1*) instead of (3,2,2,1,1,1,1). 


Definition 12.2.3 The Young diagram [A] of a partition A = (Aq,...,Ax) Fn 
can be formally defined as the set of knots or points (7,7) € Z? such that 
1L<j <XA;,2% = 1,...,&. Graphically we draw the diagrams replacing the 
knots by square boxes, adopting the convention, as with matrices, that the 
first coordinate ¢ (the row index) increases as one goes downwards, and the 
second coordinate j (the column index) increases as one goes from left to 
right. The first boxes from the left of each row are one above another and the 
i-th row contains \; boxes. We denote by ri the length of its j-th column of 
[A]. The partition \’ = (Aj,...,A;) and its diagram [A‘] are called conjugate 
respectively to \ and [A]. For example, for 4 = (4, 2,1), the corresponding 
Young diagram is given in Fig. 12.1 and (4,2, 1)’ = 3, 2,17). 


Fig. 12.1. The Young diagram [A] = [4, 2,1] 


Definition 12.2.4 The (¢, j)-hook of the diagram [A] = [A1,..., Ax] consists 
of the j-th box of the i-th row of [A] along with the A; — 7 boxes to the right 
of it (called the arm of the hook) and the Xi — 7 boxes below of it (the leg of 
the hook). The length of the hook is equal to A; + A; —i—j +1. For example, 
the (2, 3)-hook of [4?,3, 1] is of length 4, see Fig. 12.2. 





Fig. 12.2. The boxes of the (2,3)-hook of [4°,3, 1] are marked by X 


Definition 12.2.5 For a partition A = (Ay,...,A”) F n, we define a »- 
tableau T, of content a = (a1,...,a@m), where ay +... + am = n, as the 
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Young diagram [A] whose boxes are filled in with a; numbers 1, a2 numbers 2, 

.,Qm numbers m. The tableau is semzstandard if its entries do not decrease 
from left to right in the rows and increase from top to bottom in the columns. 
The tableau 7) is standard if it is semistandard and of content (1,..., 1), ie. 
every integer 1,...,7 occurs in it exactly ones. For example, in Fig. 12.3, the 
two tableaux from the left are, respectively, semistandard of content (2,3, 1, 1) 
and standard and the third is not semistandard. 





Fig. 12.3. The left and middle tableaux are semistandard, and the right is not 


Definition 12.2.6 For a partition of n, the symmetric group S, acts on 
the set of A-tableaux of content (1,..., 1) in the following way. If the (¢, j)-th 
box of the tableau 7, contains the integer k, then the (¢, j)-th box of oT) 
contains o(k), o € S,. The row stabilizer R(T) of Ty is the subgroup of S, 
which fixes the sets of entries of each row of 7). Similarly one defines the 
column stabilizer C(T)) of the tableau. For example, in Fig. 12.4 we give the 
(3, 2)-tableau T and its image under the action of ¢ = (12)(345). The row 
stabilizer of T is the direct product of S3 acting on 1,4,2 and S» acting on 
5,3. The column stabilizer is C(T) = ((15)) x ((48)). 


: 
HEIN BE 
Fig. 12.4. The right tableau is the image of the left under o = (12)(345) 


The following theorem describes the irreducible representations of the 
symmetric group. 


Theorem 12.2.7 Let K be any field of characteristic 0. For a partition 
of n and a A-tableau Ty of content (1,...,1), let R(T,) and C(T)) be, re- 
spectively, the row and column stabilizers of T,. Consider the element of the 
group algebra KS; 


Yo SS Gigny) pr 


pER(Ty) YEC(Ta) 
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(i) Up to a multiplicative constant e(T) is equal to a minimal idempotent 
of the group algebra KS, and generates a minimal left ideal of KS, (i.e. an 
irreducible S,,-module). 

(ii) If fs is another partition of n and T, is a p-tableau, then the S,- 
modules KS, -e(T,) and KS, -e(T,) are isomorphic if and only if A= p. 

(iii) Every irreducible S,-module is isomorphic to KS, -e(T,) for some 
partition \F n. 

We denote respectively by M(A) and x) the irreducible S,-module related 
with the partition 4 n and tts character. 


Remark 12.2.8 For \} n, let 7, and 7 be two A-tableaux. An isomorphism 
between the S,,-modules KS, -e(T,) and KS, -e(T2) is defined by x > xo7!," 
« € KS), -e(T\), where o € S,, is such that Tz = oT). By the Schur lemma 
(Exercise 12.1.10 (ii)), all isomorphisms are ¢. : KS, -e(T1) 3 KS), - e(T), 
where ¢a(a) =axvo-t and0 Fae k. 


Remark 12.2.9 By Exercise 12.2.1, the conjugacy classes of S$, are deter- 
mined by the cyclic decomposition of the permutations. We can relate to the 
permutation 


p=(l...p)\ptl...ptaq)...m—r4l...n)p>q>...>7, 


the partition A = (p,q,...,7) F n. In this way we define a 1-1 correspondence 
between the conjugacy classes of S,, and the partitions of n, which agrees with 
Theorem 12.1.15 (ii). One of the differences between representation theory 
of the symmetric group and of some other groups is that the number of 
nonequivalent irreducible representations of S,, does not depend on the base 
field K, i.e. we do not need the assumption that K is sufficiently large (e.g. 
algebraically closed). 


Exercise 12.2.10 Let n = 3. Find bases (as vector spaces) of the S3-modules 
KS3-e(T,) and KS3 -e(T2), where the (2, 1)-tableaux 7, and Ty are respec- 
tively the left and the right tableaux in Fig. 12.5. Find an S3-module isomor- 
phism between A.S3 - e(T,) and K.S3 - e(T2). Find a representative of each 
S3-submodule of AS3 isomorphic to M (2, 1). 


Fig. 12.5. The (2, 1)-tableaux 7, and T> in Exercise 12.2.10 


Solution. The row and column stabilizers of T, are, respectively, R(T) = 


((12)) and C(7,) = ((13)). Hence 
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vy = e(T,) = (1+ (12))(1 — (13)) = 1 + (12) — (13) — (132). 


Acting on v, by the elements of 53 we see that every element 7-e(7T1), 7 € Ss, 
is a linear combination of v; and 


vy = (123) - e(T,) = (123) + (13) — (23) -1. 
Hence M(2, 1) is two-dimensional. Similarly we obtain a basis of K$3 -e(T2) 
wy = e(T2) = (1+ (13))(1 — (12)) = 1 + (13) — (12) — (123), 
wy = (123) - e(T2) = (123) + (23) — (13) — (132). 


Since Ty = (23)7,, Remark 12.2.8 shows that an isomorphism between K'S3 - 
e(T) and KS - e(T2) is given by 


vy > v1(23)7! = v1 (23) = (23) + (123) — (132) — (13) = we, 








Hence all isomorphisms ¢q : A.S3 -e(T,) —>+ KS3 -e(T2) are defined by 


boa(v1) = awe, de(v2) = a(—wi —w2), OF AER. 


Let @: M’ —+ M” be an isomorphism between the irreducible 5'3-modules 
M’ and M”. Using the Schur lemma (Exercise 12.1.10 (ii)), it is easy to see 
that every irreducible S3-submodule of M’ 6 M” is 


Men = {(E@,99(x)) |@ € M’}, (80) F (0,0), 


and Me, = Me, if and only if (€,7) and (€’,7') are proportional. Since 
dimM (2,1) = 2, Exercise 12.1.20 gives that AS3 contains a direct sum of 
exactly two isomorphic copies of (2,1). Hence, the elements v1 + awe, 
a € K, and we can be chosen as representatives of the irreducible submodules 


M(2,1) of KS3. 


Exercise 12.2.11 Fill in the table of the irreducible characters of 5, for 
n<A4, 


Hint. Forn = 1 and n = 2 apply Exercise 12.1.17. For n = 3 use ideas similar 
to those of Exercise 12.1.18, bearing in mind that the symmetric group $3 
is isomorphic to the dihedral group of order 6 (the group of symmetries of 
the regular triangle) or working in the following way. For any n > 1 the 
group S, has two one-dimensional representations, the trivial ¢;(¢) = 1 
and the sign representation ¢2(o0) = signa, o € S,. Using Theorem 12.2.7, 
one can see that the trivial representation corresponds to the partition (n) 
and the sign representation to the partition (1”). A third representation is 
obtained from the action of the symmetric group on the vector space V, with 
basis {@1,.-.,2n} by o(xi) = oi), @ € Sy. It is easy to see that V, is a 
direct sum of two irreducible $,,-modules, one of them the trivial, spanned on 
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xy+...+2%,, and the other n — 1-dimensional. Calculate the character of V, 
modulo the trivial representation. For n = 4 use that the factor group of $4 
modulo the Kleinian subgroup {(i7)(Al) € S4} is isomorphic to S3 and the 
three irreducible representations of S3 are also irreducible representations 
of S4. The three-dimensional submodule of V4 gives the fourth irreducible 
representation. Finally, we complete the character table by the tensor product 
of the sign representation and the three-dimensional representation which we 
just found. The character tables of S3 and S4 labeled, respectively, by the 
partitions of 3 and 4, are given in Tables 12.3 and 12.4. 


Table 12.3. The character table of S3 





(3) 1 1 1 
(2,1) | 2 0 -l 
(1°) 1-1 1 





(4) | 1 1 1 1 1 
(3,1) | 3 1 0 -1 -1 
| (27) | 2 0 -1 0 2 | 
| (2, 17) | 3 -1 0 1 -1 | 
(1*) 1 -1 1 -1 1 


The degrees of the irreducible representations of 5S, can be obtained in 
two ways. 


Theorem 12.2.12 Let A be a partition of n. 
(i) The dimension dy = dimM (A) of the irreducible S,,-module M(A) is 


equal to the number of standard \-tableaur. 
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(ii) (The Hook Formula) 


n! 


dy = =. 
* TIO: +, 4-7 4D 


where the product runs on all bores (i,j) € [A], i.e. the denominator is equal 
to the product of the lengths of all hooks of the diagram [A]. 


Exercise 12.2.13 Calculate the degrees of the irreducible representations of 


Ss. 


Hint. (i) Applying Theorem 12.2.12 (i), we obtain, for example, that there 
are five standard (3, 2)-tableaux given in Fig. 12.6 and dimM (3, 2) = 5. 


Fig. 12.6. The standard (3, 2)-tableaux 


(ii) Applying the hook formula of Theorem 12.2.12 (ii), we obtain the 
lengths of the hooks of the diagram [8, 2] (written in the boxes of the diagram 
in Fig. 12.7). 


Fig. 12.7. The lengths of the hooks of the diagram [3, 2] 


Hence 


1.2.3.4.5 
dimM(3,2)= 73794 = 


Similarly we obtain for the other irreducible S;-modules 
dimM (5) = 1, dimM(4, 1) = 4, dimM(3, 1°) = 6, 
dimM (27, 1) = 5, dimM(2, 1°) = 4, dimM (1°) = 1. 


Remark 12.2.14 In Table 12.5 we give the character table of the symmetric 
group Ss which we need in some of the exercises below. 
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Table 12.5. The character table of Ss 


1 (12) (123) (1234) (12)(34) (123)(45) (12345) 





(5) | 1 1 1 1 1 1 1 
(4,1) | 4 2 1 0 0 -1 -1 
(3,2) | 5 1 -1 -1 1 1 0 

| (3, 17) | 6 0 0 0 -2 0 1 | 
| (27,1) | 5 -1 -1 1 1 -1 0 | 
| (2, 1°) | 4  -2 1 0 0 1 -1 | 

(1°) 1 -1 1 -1 1 -1 1 


12.3 Multilinear Polynomial Identities 


This section is devoted to some applications of representation theory of the 
symmetric groups to Pl-algebras. The main idea is to use the action of the 
symmetric group 5;,, on the vector space of multilinear polynomials of degree 
n and to translate and solve the considered problems on PI-algebras in the 
language of representation theory of S,,. This action of S,, is given in the next 
exercise. 


Exercise 12.3.1 Let P, be the set of all multilinear polynomials of degree 
n in the free associative algebra K(X). 

(i) Show that the following action of the symmetric group S, makes P,, 
a left S,-module isomorphic to the group algebra A'S, considered as a left 
S,-module (related to the regular representation of S,,): 


o(S- A484, ...€;,) = S- aita(i,) + 2o(in)s TE Sn, a EK, Ui, ... i, © Ph. 
(ii) If U is a T-ideal of K(X), prove that UMP, is a submodule of P,. 


Hint. (i) The vector space P, has a basis 
{®@x(1) ...2ax(n) | TE Sn } 


and, identifying 7 with &7(1)...%,(n) we obtain an isomorphism of A'S, and 
P,, as vector spaces. Show that this is also an S,,-module isomorphism. (ii) Use 
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that the T-ideal U is invariant under all substitutions and for f(a1,...,%n) € 
UN Pr» 


o(f(at1,---,®n)) = F(@o(1),-- +s ®o(ny) EUN Pa, 7 E Sn. 


In order to apply successfully representation theory of S,, to concrete prob- 
lems we have to know the module structure of some important submodules 
of the S;,-module P,,. The next exercises contain some typical calculations. 
(In the next sections we shall see how some of the considerations can be 
simplified using representation theory of the general linear group.) 


Exercise 12.3.2 Let L(X) be the free Lie algebra considered as a Lie sub- 
algebra of the free associative algebra K(X) and let PL, = P,M L(X) be 
the set of multilinear Lie polynomials of degree n. Determine the S,,-module 
structure of PL, for n < 3. 


Solution. For n = 1 the vector space P£, is one-dimensional and S; has the 
trivial irreducible representation only. Hence PL, = M(1). 
For n = 2, again dimPL, = 1 because PL is spanned on [#1, 22]. Since 


(12)[21, v2] = —[x1, v9], 


we obtain that this is the sign representation of S2.. Hence Pho = M(1*). 
Now, let n = 3. We shall consider several possibilities to handle the prob- 
lem. 


Method 1. We shall show directly, that PZ is an irreducible S3-module. Let 
M be an irreducible submodule of PL3 and let 0 4 f € M. Every element of 
PIs has the form 


f = f(e1, 22, £3) = alee, £1, ¢3] + Blas, £1, £2], a, 8 € K. 
Since [a3, 2, 1] = —[ae, 71, 3] + [x3, 21, rol, 
(12)f = —(a + B)[e2, 1, £3] + Bla3, €1, €2] €E M 


and if 8 #0 and 2a+ 2 40, we obtain that both [a2, a1, a3] and [as, #1, #9] 
are in M. Hence M = PL3. If @ = 0, then f = ale, #1, 23] and (23)f = 
alas, 1,2] © M,ie. again M = PLs. Finally, if 2a+ 6 = 0, we may assume 
that a= 1, 6 = —2 and 


f = [v2, 21,23] — 2[@3, 21,29] € M, (23) f = —2[r2, 21, #3] + [z3, 21, e2] © M. 


Again, f and (23)f are linearly independent and M = PL. Having a look in 
the character table of S3 (see Exercise 12.2.11 and Table 12.3) we make the 
observation that the only irreducible two-dimensional S3-module is M (2, 1). 
Hence PL3 = M(2, 1). 
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Method 2. We shall calculate the character y of PL3. Since dimPL3 = 2, we 
obtain y(1) = dimPL3 = 2. Further, 
(23)[v2, #1, es] = [ws, #1, v2], (23)[v3, 21, 22] = [v2, v1, vs], x(238) = 9, 
(123)["2, v1, @3] = [v3, @2, 01] = —[v2, 21, e3] + [w3, @1, v9], 
(123) [#3, #1, v2] = —[@e2, 1, e3], x(123) = —-1. 
Hence y = X(2,1). Another possibility is to assume that 


xX = m(3)x3) + m(2, Dx2,1) + MT?) x18) 


for some nonnegative integers m(3), m(2, 1) and m(13). Using the character 
table of Ss we obtain the following linear system with unknowns m(3), m(2, 1) 
and m(13): 

x(1) = 1-m(3) +. 2+ m(2,1) + 1-m(1?) = 2 

x(12) = 1- m(3) + 0- m(2,1) — 1-m(1?) = 0 


x (123) = 1-m(3) — 1-m(2,1) +1-m(1?) = -1. 





The only solution of the system is 
m(3) = 0, m(2, 1) = 1, m(1*) = 0, 
and this gives again that PL3 = M(2,1). 


Method 3. Let 1 : KS3 —> P3 be the S3-module isomorphism in Exercise 
12.3.1. We make use of Exercise 12.2.10 and use its notation. For every S3- 
submodule M (2, 1) of P3 we may choose a representative which is equal either 
to e(v; + awe) for some a € K or to (we), where 


é(v1) = o(1 + (12) — (13) — (182)) = a1 @or3 + vor 83 — @3%2r1 — 13212, 


t(we) = 4((123) + (23) — (18) — (182)) = wongay + ay agr2 — gto) — 13% Xo. 


We are looking for a € K, such that «(v1 + aw2) € PL3. This gives the 
condition 








€1Lo9tg t+ €oe 1 e3—U3qe,—X3H C+ 0(e9%3e] +41 e3t2—U3X9X|—X3%1L2) = 


= €[%2, 21, ¢3] + lez, 21, €2],€,n EK. 











Since 

[xo, v1, x3] = £20 1X3 — €1 LQe3Z — U3X9L1 + UZX1L2, 

[x3, v1, x2] = €3t LQ — Li L3ZkQ — LOX Ze, + CQL Ls, 
comparing the coefficients of x;x;x;,, we obtain that a = —2, € = —-1, 7 = 2, 
le. 


(v1) — 2e(we) = —[we, #1, 3] + 2[a3, €1, v2] 


and this means that PLs contains a submodule M(2, 1). Since 
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dimPL3 = dimM (2, 1) = 2, 
we obtain that PL3 = M(2,1). 


Exercise 12.3.3 Determine the S4-module structure of the sets of the mul- 
tilinear Lie elements PL, and of the proper multilinear elements I’, in K(X). 


Hint. (i) Use the basis of PL, consisting of the following elements 
[r2, 01,03, xa], [v3, v1, 02, xa], [4, v1, 02, x3], 


[[x2, x1], [v4, vs], [lzs, x1], [v4, v]}, [[x4, x1], [v3, xo], 


and calculate the character y of PL4. Use the character table of S4 (Exercise 
12.2.11 and Table 12.4) to decompose x as a sum of irreducible characters. 
The final result is 

PLy = M(3,1)@ M(2,1’). 


(ii) Use the basis of I consisting of 
[wj,21,09,...,€;,..., ea], = 2,3, 4, 
[vi,vy][ex, ri], i> j,k >l, 


(see Theorem 5.2.1). It is better to simplify the calculations in the following 
way. Consider the factor module I'4/PL4. It has a basis 


{[x, 21] 0 [v4, ¢3], [x3, 21]  [e4, 29], [v4, 21] © [x3, val}, 


where uo v = uv + vu. Calculate its character and show that I'4/PL4 = 
M (2?) 6 M(1*). Derive from here that 


Ty = M(3,1)6 M(2”) 6 M(2,1°) 6 M(1*). 


Exercise 12.3.4 Determine the Ss-module structure of the sets of the mul- 
tilinear Lie elements PLs and of the proper multilinear elements I's in K(X). 


Hint. Use the character table of S; in Remark 12.2.14 and Table 12.5. The 


answer 1s 
PLs = M(4,1) ® M(3,2) 6 M(3, 1") @ M(2?,1) 6 M(2, 19), 
T's = M(4,1) 6 2M(8, 2) 6 2M(3, 1°) @ 2M (2, 1) @2M(2, 1°). 


When we consider the applications of representation theory of the general 
linear group we shall give another method for decomposing the Lie and the 
proper multilinear polynomials of small degree. 


Remark 12.3.5 There are several descriptions of the S,-module structure 
of the set of multilinear Lie elements PL,. In the paper of Klyachko [150] 
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this description is given in the language of the so called induced characters 
and in the paper of Thrall [251] in the language of plethysms. See also the 
book by Bahturin [21]. 


Below we define the cocharacter sequence of a Pl-algebra or of the cor- 
responding T-ideal. This is one of the most important invariants of the PI- 
algebra which carries almost all necessary quantitative information about its 
polynomial identities. 


Definition 12.3.6 Let R be a Pl-algebra and let P,(R) = Py/(T(R)N 
P,), n= 0,1,2,...The S,-character 


Xn(R) = XP, (R) = S- m(R)xx 
AEn 
is called the n-cocharacter of the polynomial identities of the algebra R. The 
sequence 


Xn(R), n= 0,1,2,..., 


is called the cocharacter sequence of R. 


Since the n-th codimension c,(R) of the Pl-algebra R is equal to the 
dimension of the S,-module P,(R), we obtain immediately that c,(R) is 
equal to the evaluation of y,() on the identity permutation. 


Exercise 12.3.7 Show that for any (unitary) commutative algebra R 
Xn(R) = Xin), P= 0,1,2,..., 


i.e. the cocharacter sequence of R consists of trivial characters only. 


Hint. Since the T-ideal of R coincides with the commutator ideal of K(X), 
the relatively free algebra F(R) is isomorphic to the polynomial algebra K LX] 
in infinitely many commuting variables. Hence P,, (2) is spanned on the mono- 
mial x, ...x, and 


O(%1...8@p) = &1...%n, 7 © Sh, 


i.e. P,(R) is the trivial module of S,. 


Now we shall give the Regev proof [223] of the Amitsur theorem [10] that 
every Pl-algebra satisfies some power of a standard identity. The original 
proof of Amitsur is based on his theorem that the T-ideals T(M),(4)) are the 
only prime T-ideals (see Remark 5.2.2) and the fact that the Jacobson radical 
of any PI-algebra is nil (which is a much easier result than the Razmyslov- 
Kemer-Braun theorem for the nilpotency of the radical of a finitely generated 
Pl-algebra). The proof of the theorem of Amitsur can be found e.g. in the 
book of Rowen [231]. 
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As in Theorem 12.2.7, for a partition A of n and a standard A-tableau T, 
we denote by R(T) and C'(T), respectively, the row and column stabilizers of 
T, and assume that 


T)= SS) SO (signy)pr. 
pER(T) yEC(T) 
We also denote by ¢ the S,,-module isomorphism in Exercise 12.3.1 
tu: KS, —> Py. 
Lemma 12.3.8 Let \ = (m*) be a partition of n = km in k equal parts and 
let the first column of the standard \-tableau T be filled in with 1,2,...,k, 
the second column with k+1,...,2k, ..., the m-th column with (m— Ik + 


1,..., km. Let 
f(@1,..-, km) = e(e(T)) 


be the multilinear polynomial of Perm corresponding to e(T). Then the homo- 
geneous polynomial 


w(@1,...,€k) = f(@1,...,€k, €1,-. 5 eye. E1,--- 5 eh) 
obtained by identifying the variables with indices in the same row of T, 1.e. 
Uji = epi =--- = L(m—1)egi, t= l...,k, 


is equal to (m!)*s'(a1,..., 2%), where s, is the standard polynomial of degree 


k, 


Proof. The column stabilizer of T' is 
C(T) = Sk x 1.2. xX Sp, 


where the j-th copy of S;, acts on {(j —1)k+4+1,..., 7k}. Hence 


t{ So Gigny)y] = 
yeEC(P) 
= 8k (@1,---,@k)Sk(@et1,---, Cae) --- 8k (Um—1)k415 +++) Bmk)- 


Since 4 is an S,-module isomorphism, 


ue(T)) =e So p So (signyy} = SO pe | SO (signa 
pER(C) yEC(P) pEeR(C) yEC(T) 
The row stabilizer R(T) is a direct product of k copies s®) of symmetric 
groups, the group s!) acting on {#,k +%,...,(m—1)k +7}. Hence, for fixed 
1, the variables 2;, &4i,.--,%(m—1)k4i are Symmetric in 
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S- pe S- (signy)y | , 


nes) yEC(T) 
and the substitution 
Uji = epi =--- = L(m—1)egi, t= l...,k, 
gives 
k 
w(@1,...,¢k) = (m!)"s0 (a1, ..., ee). 


Example 12.3.9 If k =m = 2, then T is given in Fig. 12.8 and 


Fig. 12.8. The tableau T for m =k = 2 


t{ S© Gsigny) | = (= (12))(1 = (84) - eiameesag = 
yeEC(T) 


= (x a2 _ €o%1)(L3h4 _ £403) = 89(a1, t2)8o(X3, ta), 


f(e1, %2, &3,t4) = e(e(L)) = S- p+ 89(%1, €)82 (#3, 4) = 
peR(T) 


= (1+ (13))(1 + (24) + 82(@1, &2)82 (#3, e4) = 
= 89(#1, €2)82(@3, 4) + $9(%3, ¥2)52(a1, &4)+ 
+89(#1, €4)89(%3, £2) + $9(%3, &4)S82 (41, £2), 


f(€1, 42, r1, £2) = (21)? 83 (a1, &2). 


Theorem 12.3.10 (Amitsur [10]) Every Pl-algebra satisfies the polynomial 
identity 


Sh (@1,..., eR) = ( S- (signa) ao(1)-. 0] =0 


oeS, 
for some k,m> 1. 
Proof. We give the proof of Regev [223]. Let the Pl-algebra R satisfy a poly- 


nomial identity of degree d. By Regev Codimension Theorem 8.1.7, the codi- 
mension sequence of F satisfies the inequality 
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en(R) < (d—1)*", n=0,1,2,... 


The idea of the proof is the following. We shall find & and m such that 
the dimension d) of the irreducible S;,,-module M(A) corresponding to the 
partition \ = (m*) is bigger than cpm(R). This implies that M(A) is not 
a submodule of the Sjm-module Pym(R). By Maschke Theorem 12.1.6, the 
following Szm-module isomorphism holds: 


Since M(A) is irreducible and participates in the decomposition of Pym, we 
obtain that all submodules of Pym isomorphic to M(A) belong to PkmOAT(R). 
In particular, the element ¢(e(T')) also belongs to Pkm MT(R), where T is 
the A-tableau considered in Lemma 12.3.8. This gives that s? € T(R), Le. 
s,’ = 0 is a polynomial identity for R. 

We fix m € N such that m > 2(d—1)? and allow k > m. We shall apply the 
hook formula in Theorem 12.2.12 (ii) for the dimension of the $;,-module 
M(m*). Since each row of the diagram [A] = [m*] has m boxes and each 
column has k boxes, we obtain that the length of the (7, j)-th hook of [A] is 
m+k—t—j+1 and 


(m — 1)!(m — 2)!...1!0! (km)! 


1. = (bm) on ktm DE (REI (Rm 





We use the Stirling formula for n! 


1 


nt = Vornn"e"e9™) , |6(n)| < —, 
12n 


or, for sufficiently large n, 
nie V2arnn"e””. 


We consider & sufficiently large and make @ in the Stirling formula close to 
0. Since m is fixed, we obtain 


(km)! 7 V2rkm(km)k™e—k™ 


> Tee myn © ( In(k + m)(k-+ my btm e= (hem) 


We have assumed that k > m. Hence V27km > 1,k +m < 2k and 


2 
kE™mk™em 


> gmazm/2pm/2E(k+m)mga(k+m)m 


_ a (zy a (my _ em 
~ pmetm/2 \9 > kam? \9 ,a= Qmetm _~m/2° 


Since m > 2(d— 1)? and cym < (d— ym, we obtain 


dy 
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lim ——~ = 
k-+00 Ckm(R) 


and, for & sufficiently large, 
dy > Chm(R). 


As we have already seen, this implies that 57’? = 0 is a polynomial identity 


for R. 


Remark 12.3.11 (i) The original proof of Theorem 12.3.10 given by Amitsur 
[10] does not provide estimates for the values of & and m. In the proof of Regev 
presented above, one can obtain some bounds for & and m. For better bounds 
see the paper by Regev [223]. 

(ii) Since the degrees of the irreducible S;,,-representations corresponding 
to the conjugate partitions m* and k” are equal, one can obtain that if s” = 0 
is a polynomial identity for the algebra R, where k and m are obtained from 
the proof of Regev, then s*, = 0 is also a polynomial identity for R. 


Now we state without proof the theorem of Amitsur and Regev [12] about 
the partitions and the shapes of the Young diagrams corresponding to the 
irreducible characters in the cocharacter sequence of any Pl-algebra. The 
proof is based on estimates similar to those in Theorem 12.3.10 and work 
with the elements e(7') generating irreducible S,-modules. I think that the 
reader is prepared to follow the proof in the original paper of Amitsur and 
Regev. 


Theorem 12.3.12 (Amitsur-Regev [12]) For every Pl-algebra R there exist 
nonnegative integers k and l such that in the cocharacter sequence of R 


xn(R) = Som (R)xy, 2 =0,1,2,..., 
AFn 


the partitions \ = (Ay,..., Am) corresponding to nonzero multiplicities my (R) 
satisfy the condition Apyi < lL. In other words, their diagrams [A] are in a 
hook with height k of the arm and wide l of the leg, see Fig. 12.9. 


If we know the size of the hook of Theorem 12.3.12, we can estimate better 
the asymptotic behaviour of the codimension sequences of the Pl-algebra. For 
example, one can show that if in the notation of Theorem 12.3.12 Ax41 < 
for all A with m)(R) # 0, then 


limsup Ven(R) <k +1. 


noo 


This result follows from the theorem of Berele and Regev [32] that the multi- 
plicities m,(R) are bounded by a polynomial of n = |A| and estimates similar 
to these in the proof of Theorem 12.3.10. Hooks which provide more detailed 
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Fig. 12.9 The diagrams [A] with m)(R) 4 0 are in a hook 


information about the nonzero multiplicities in the cocharacter sequence of 
the PI-algebra are studied by Popov [210]. 


12.4 The Action of the General Linear Group 


In this section we survey the information on representation theory of the 
general linear group in a form which we need for our study of Pl-algebras. 
We restrict most of our considerations to the case when GL, (A) acts on the 
free associative algebra of rank m. The main application of representation 
theory of GL,(K) in this section is the theorem of Berele and Drensky. Freely 
restated, it gives that any result on multilinear polynomial identities obtained 
in the language of representations of the symmetric group is equivalent to a 
corresponding result on homogeneous polynomial identities obtained in the 
language of representations of the general linear group. 


Definition 12.4.1 Let ¢ be a finite dimensional representation of the gen- 
eral linear group GL»(K), i.e. 6: Ghp(K) —> GL,(K) for some s. The 
representation ¢ is polynomial if the entries (¢(g))pq of the s x s matrix ¢(g) 
are polynomials of the entries ay; of g for g € GLhm(K), k,l = 1,...,m, 
p,qg=1,...,s. The polynomial representation ¢ 1s homogeneous of degree d if 
the polynomials (¢(g))p~ are homogeneous of degree d. The GL,, ()-module 
W is called polynomial if the corresponding representation is polynomial. Sim- 
ilarly one introduces homogeneous polynomial modules. 


As in Chapter 10, we fix the vector space V,, with basis {@1,...,@,} and 
with the canonical action of GL»,(K). We also assume that 
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K(Vn) = K(a1,..., 2m) 


is the free associative algebra of rank m. 
The following exercise introduces the action of GL,,(K) which we shall 
use till the end of the chapter (compare it with Exercise 12.3.1). 


Exercise 12.4.2 Extend the action on Vj, of GL», () diagonally on the free 
associative algebra A (Vin) by 


g(a, ..-%1,) = gle@i,).-.9(t,), 9 € GIm(K), a, ...01, E KV). 


(i) Show that K(V,,) is a left GL, (K)-module which is a direct sum of 
its submodules (K(V;,)), n = 0,1,2,..., where (K(Vmn))™ is the homoge- 
neous component of degree n of A (Vin). 

(ii) Show that for every T-ideal U of K(X), the vector spaces UM K (Vin) 
and UN (K (Vin) are submodules of K (Vi). 

(iii) Show that every submodule W of K(V,,) is a direct sum of its ho- 
mogeneous components WM (K(Vin))@). 


Hint. (i) Show that the action of GL, (4) is a module action and (K (Vin) 
is GL,,(K)-invariant. 

(ii) Use that if f(a1,...,@m) belongs to the T-ideal U and g € GL», (K), 
then 

g(f(t1,-.-,%m)) = F(g(@1),---,9(4m)) EU, 

and UN K(Vin) is GL», (4K )-invariant. For UN (K (Vin) use that the T- 
ideals of K(X) are homogeneous ideals. 

(iii) If f, is the homogeneous component of degree n of the polynomial 
f in W, show that the action of the scalar matrix ae, 0 4 a € K, e being 
the identity matrix in GL,,(/), multiplies f, by a”. Apply Vandermonde 
arguments to show the statement for the homogeneous components. 


The polynomial representations of GL,,(4) have many properties similar 
to those of the representations of finite groups. 


Theorem 12.4.3 (i) Every polynomial representation of GLm(K) is a direct 
sum of irreducible homogeneous polynomial subrepresentations. 

(ii) Every irreducible homogeneous polynomial GL», (K)-module of degree 
n > 0 is isomorphic to a submodule of (K(Vn))™. 


The irreducible homogeneous polynomial representations of degree n of 
GLm(K) are described by partitions of n in not more than m parts and 
Young diagrams. 


Theorem 12.4.4 (i) The pairwise nonisomorphic irreducible homogeneous 
polynomial GL», (K)-representations of degree n > 0 are in 1-1 correspon- 
dence with the partitions X = (Ai,...,Am) of n. We denote by Wy,(A) the 
irreducible GLm(K)-module related to X. 
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(ii) Let A = (Ay,...,Am) be a partition of n. The GLy,(K)-module 
Wr (A) is isomorphic to a submodule of (K (Vin), The Gh »(K)-module 
(K(Vin))) has a decomposition 


(K(Vimn))™ & So d\Win (A), 


where dy is the dimension of the irreducible S,-module M(A) and the sum- 
mation runs on all partitions X of n in not more than m parts. 

(iii) As a subspace of (K(Vin)), the vector space W,(A) is multihomo- 
geneous. The dimension of its multthomogeneous component Wrenn) 18 


equal to the number of semistandard \-tableaux of content (ny,...,%m). 
(iv) The Hilbert series 


Hilb(Wmm (A), t1,--.5tm) = $2 dimWPrror mets thm 


is a symmetric polynomial and is equal to the quotient 


D(A +m 1,Ag+tm 2,...,Am—1 + 1, Am) 
Dim —1,m-—2,...,1,0) , 





where 7 7 In 
th ts J. RT tha 
ty? tw. th 
D(p1,- ++; bm) = : : . : : 
ett thm tit 
um tem tm thm 








The function 
Sy (ti,.--,tm) = Hilb(Win (A), t1,.--5tm) 


is called the Schur function corresponding to X. 


Example 12.4.5 Let m = n = 3 and let A = (2,1). The semistandard 
(2, 1)-tableaux of content (n1,n2,n3) for ny + no + ng = 38, 2) > ne > 
ng, are given in Fig. 12.10. Hence there are two semistandard tableaux of 
content (1,1,1) and one tableau of content (2,1,0). Since S/2,1)(t1, t2, t3) is 
a symmetric polynomial, we obtain that 


S(o,1) (ti, te, t3) = 2tytets + tite + tit3+ 





tty + ty + tt + toth. 


Using the determinant formula, we obtain 
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(1, N2, n3) = (1,1, 1) 


(m1, ne, n3) = (2, 1, 0) 


Fig. 12.10. The semistandard (2, 1)-tableaux of content (1, 1,1) and (2, 1,0) 











tH ty ts tty 3 
D(24+2,14+1,0+0)=|t 3), D(2,1,0)=]t te ts}, 
1 1 1 1 1 1 
D(24+2,14+1,0+0) 
S21) (t1, ta, t3) = —p@,1,0) = 
t? — t3)(t7 — #3) (6 - 3 
= ( 1 3)( 1 3)( 2 3) = (ty to) (ty ts) (to ts) 
(t1 — ta) (ti — ts)(t2 — ts) 
which gives the same answer. 
Exercise 12.4.6 Show that 
Stny(t1,---5tm) = hn (tay. --5tm) = Sept. na to +m = 0, 


is the complete symmetric polynomial of degree n, 


Sany(ti,...,tm) = en(ti,-.-,tm) = Seti .- ti, LS i <...<in <m, 


is the elementary symmetric polynomial of degree n < m. 


Hint. The semistandard (n)-tableaux are in 1-1 correspondence with the 
monomials of degree n in t),...,¢m. For A = (1") the correspondence is 
with the monomials of degree < 1 in each variable. 


Theorem 12.4.7 Let W = S>m,W,,(A) be a polynomial GL (K)-module, 
Wir(A) C K(Vn). Then the Hilbert series Hilb(W,t1,...,tm) determines W 


up to isomorphism. 


Corollary 12.4.8 Let Fy,(R) be the relatively free algebra of the variety 
generated by a Pl-algebra R. The Hilbert series Hilb(Fim(R),t1,...,tm) is a 
formal series of Schur functions and if 
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Hilb(Fyn(R),t1,..-,t = Di bale )Sx(t1,-..,tm), A= (At,---; Am); 


then 


R)= dh W, 


i.e. the Hilbert series of F(R) determines the GL»(K)-module structure of 
Fy,(R). 


Proof. By Exercise 12.4.2 (ii) and Theorem 12.4.3, F(R) is a direct sum 
of (maybe infinitely many) irreducible polynomial G,,(A)-modules Wy, (A), 
A = (A1,..-,Am). Now the proof follows from Theorem 12.4.7 applied to 
FLOR). 


Exercise 12.4.9 Let W be a GL,,(K)-submodule of K(V,,) and let 


Dm = {ate = d(E1, : ..5€m) = So Geni € Ginx) 


i=l 


be the diagonal subgroup of GL,,(K). Show that the Hilbert series of W 
plays the role of the character of D,, and 


Hilb(W, €it, ...,Emt) = S© try (d(€))e”, 
m>o0 


where tryy(») (d(&)) is the trace of d(€) acting on the homogeneous component 
W) of degree n of W. 


Hint. Use that W is a multigraded vector subspace of A (Vi) and the mul- 
tihomogeneous polynomials are the eigenvectors of d(€). For a multihomoge- 
neous polynomial f(x1,...,@m) of degree n; in ;, the action of d(&) gives 


d(E)\f(e1,...,@m) = Ef. ER F(a, ..., Bm). 


Exercise 12.4.10 (i) Let BY, m > 1, be the subspace of all homogeneous 
proper polynomials of degree n in K(Vi,). Show that Be isa Glm(K)- 


submodule of K(V,,). Find the decomposition of BY) into a sum of irre- 
ducible submodules for n = 2,3,4,5. 

(ii) Find the decomposition of the vector space of homogeneous Lie poly- 
nomials of degree n = 2,3,4,5in K(Vin), m> 1. 


Sketch of Solution. (i) Since the elements of BY) are linear combinations of 
products of commutators, it is easy to see that BY) is GL» (A )-invariant. 


As graded vector spaces BO and Be) have respectively bases 
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{[vi,2j] |¢ >a}, (lei, aj, ee] |2> 7 < Kf. 


We define a correspondence between the basis elements [a;,2;], 7 > j, of 


BY) and the semistandard (1°)-tableaux with column filled in with j,7. By 
Theorem 12.4.4 (iii) 


Hilb(B),t1,...,tm) = Sqa2)(t1,---, tm) 


). For n = 3 we relate to Le aj i> 


and Be ~ Wrl(l? j 
1)-tableau in Fig. 12.11, which gives Be) = Wr (2, 


< k the 
semistandard (2, 1 


). 


Fig.12.11. The (2, 1)-tableau related to [xi,2;, 2%], i> j <k 


(4) 


For n = 4 we use the basis of By 
{[vi,ej, ene) |t>7ck<UU {lei z][ee, ei] [i> 7, k > Uf. 


As for n = 2 and n = 3, the Hilbert series of the subspace spanned by 
[e;,%;,¢,,2:],0 > 7<k <1, (which is not a GL,,(K)-submodule!) is equal 
to the Schur function S)(t1,...,¢m) for A = (3, 1) (prove it!). Let 


W = span{[x;,2,][@x, 2] |i > Jj, k > Uf. 


Since W is a homogeneous polynomial GL,,(4)-module of degree 4, its 
Hilbert series 


Hilb(W) = Hilb(W, t1,...,tm) = $2 R(A)S) (th, «<5 tm): 
AFA 


is asymmetric polynomial of degree 4. It is sufficient to consider semistandard 
A-tableaux of content 


(1, 2, 3,4) = (M1, N2,N3,74,0,...,0), m1 > ne > ng > ng. 


For (n1, 22,3, M4) = (4, 0,0, 0) we obtain that there is only one semistandard 
tableau and it is for \ = (4). Hence 


0 = Hilb(W, t1,0,...,0) = &(4), k(4) = 0. 


Similarly, for (m1, n2,n3,n4) = (3,1,0,0), the homogeneous component of 
W is equal to 0. The only semistandard tableau for A # (4), ie. for \ = 
(3, 1), (27), (2, 1°), (14), is given in Fig. 12.12 and k(3,1) = 0. 
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Fig. 12.12. The only semistandard -tableau of content (3, 1,0,0) for ’ 4 (4) 
(2, 2, 0, 0) : [x2, x 1|[r2, r1] 
(2, 1, 1, 0) : [x2, «1][%3, x1], [@3, x1][%2, r1] 


(1, 1, 1, 1): [wa, x1] [x4, esl, [as, x1] [x4, Lol, [aa, x1 \[x3, Lol, 





[v4, xg][ae, €1], ee eee 


314) Bo 





Fig. 12.13. Basis vectors of W and the corresponding tableaux 


Counting the basis vectors of W of degree (2, 2, 0,0) and the semistandard 
A-tableaux of content (2,2,0,0), and similarly for (2,1,1,0) and (1, 1,1, 1), 
we obtain respectively the polynomials and tableaux in Fig. 12.13. In this 
way we obtain the linear system 


1 = k(27)-14 k(2,17) -0+ k(1*) -0 
2 = k(27)-1+k(2,1°)-1+ k(1*) -0 
6 = k(2°)-2+4h(2,1°)-34 k(14)-1 


and its only solution is 








k(27) = k(2, 1°) =k(1*) = 1. 


Hence 


Hilb(W) = S(22) + S(2,12) + S(14); 
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Hilb(BO) = Si3.4) + Hilb(W), 
BY) = Wr (3,1) ® Win (27) ® Win (2, 1?) @ Win (1*). 
Similar calculations give that 
BO) = Wn (4, 1) @ 2Wm (3,2) 6 2Wn(3, 1°) QW (27, 1) 6 2Wn (2, 1°). 
(ii) The calculations are as in (i). The answer is 
LD?) = BO) = w,,(1”), L@) = BS) = w(2,1), 


m m 


m 


L®) & Win (4,1) © Win (3,2) ® Win (3, 17) 6 Win (27, 1) © Win (2, 1°). 


LY! & W(3, 1) 6 Wn (2, 1), 


There is a close relation between the irreducible polynomial representa- 
tions of GL, (4) and the irreducible representations of the symmetric group 
Sp. In order to state an analogue of Theorem 12.2.7, we introduce a right 
action of S, on (K (Vin by 


(ai, .. tj, )o7* = Bigg + Ciggys Vip ++ Vi, E (K(Vin))™, TE Sp. 


Pay attention that the left action of S, on P, (Exercise 12.3.1) is an action 
on the variables and now S;,, acts on the positions of the variables. 


Exercise 12.4.11 Show that (K (Vin) is a right S,-module under the 
above action. 


Hint. Show that (fo)r = f(or) for f € (K(Vn,))™ and o,7 € Sp. Pay 
attention that we need o~! in the definition of the right S,,-action because 
we consider the permutations as functions, i.e. in or we first apply 7 and 
then oc. 


Let A = (Ai,...,Am) be a partition of n in not more than m parts and 
let q1,..., qm be the lengths of the columns of the diagram [A] (i.e. k = Ay 
and q; = X';). We denote by 5, = s,(r1,...,%q@), ¢ = u, the polynomial of 
K(Vn) 


k 
8 (@1,...,%q4) = II Sq; (1, ++, &q;); 
j=l 
where s,(%1,...,2p) is the standard polynomial. 


Theorem 12.4.12 Let X = (Ai,...,Am) be a partition of n in not more 
than m parts and let (K(Vm))( be the homogeneous component of degree n 
in K (Vin). 

(i) The element s)(x1,...,%q), defined above, generates an irreducible 


Glm(K)-submodule of (K(Vn))™ isomorphic to Wm(A). 
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(ii) Every Wy, (A) C (K(Vin))™ as generated by a nonzero element 


wy(@1,...,€¢) = $a(@1,..., 4) S- gO, A EK. 
TES y 


The element w(a1,...,&q) ts called the highest weight vector of W,,(A). It is 
unique up to a multiplicative constant and 1s contained in the one-dimensional 
vector space of the multthomogeneous elements of degree (A1,...,Am) in 
Wr (A). 

(iii) If the GLm(K)-submodules W! and W" of (K(Vm))™ are isomorphic 
to Wy,(A) and have highest weight vectors w' and w", respectively, then the 
mapping dq, :w' 3 aw",0#a € K, can be uniquely extended to a GLm(K)- 
module isomorphism. Every isomorphism W' = W" is obtained in this way. 


Example 12.4.13 (Compare with Exercise 12.2.10.) Let m > 2 and let 
A = (2,1). Applying Theorem 12.4.4 (ii), we use that di2,1) = dimM (2, 1) = 2 
and obtain that W,,(2,1) participates in the decomposition of (K (V,))@) 
with multiplicity 2. The lengths of the columns of the Young diagram [2, 1] 


are equal, respectively, to 2 and 1. Hence 
re _ _ _ 
W = §(2,1) = 89 (a1, £2)81(21) = [@1, @o)ay = EEA, — LQL Ey] 


is the highest weight vector of a GL,,(K)-submodule W’ = W,,(2,1) of 
(K(Vin))®. For o = 07! = (13) 


w= §(2,1)F = C121 — Le e2 = —a1[%1, x2] 
is a highest weight vector of another submodule W” = W,,,(2, 1). Since the 
highest weight vectors in W’ and W” are unique up to multiplicative con- 
stants, and w’ and w” are linearly independent, we obtain that W/NW” = 0 
and W! @ W” is a direct summand of (K(V))@). All isomorphisms of the 
GLm(K)-modules W’ and W” are determined by the mapping 


dai w saw 0OFaEK. 


Since every submodule W C (K(Vin))&), WwW 
W’ @W", the highest weight vector of W is w 


W,,(2, 1), is contained in 
Ew’ + nw", (€,) # (0,0). 


By Theorem 12.4.4 (ii), the multiplicity of Wy (A) in (K(Vin))™, is equal 
to the dimension d, of the irreducible S,-module M(A), A = (Ai,...,Am) Fn. 
Hence every W = Wy, (A) C (K (Vin) is a submodule of the direct sum of 
dy isomorphic copies of W,,(A) and the problem is how to find the highest 
weight vectors of these d, modules. 

We fix a partition A = (\1,...,Am) of n. Let the columns of the Young 
diagram [A] be of length qi,...,¢%, & = A1. For a permutation o € S, we 
denote by T(c) the A-tableau such that the first column of T(c) is filled in 


Il I 
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consequently from top to bottom with the integers o(1),...,0(q1), the second 
column is filled in with o(q, + 1),...,o(¢1 + qa), ete. 


Proposition 12.4.14 Let X = (A1,...,Am) be a partition of n and let 
Wim (A) C (K (Vin). The highest weight vector w, of Wm(A) can be ex- 
pressed uniquely as a linear combination of the polynomials wo = s,o7', 


where the o’s are such that the d-tableaur T(o) are standard. 


Proof. By Theorem 12.2.12 (i), dy is equal to the number of standard »- 
tableaux. On the other hand, the homogeneous component of degree A = 
(A1,..-,Am) of each of the dy copies of W,, (A) is one-dimensional. Hence the 
A-homogeneous component of the direct sum is of dimension equal to d,. We 
shall establish the proposition, if we see that the polynomials w,, with T(c) 
standard, are linearly independent. We consider the lexicographic ordering 
on A(Vj,) assuming that a; >... > #m. If T(c) is a standard A-tableau, 
then its entries increase from top to bottom and from left to right. It is easy 
to see that the leading term of we is ay ...©2ra7! (prove it!). Hence the 
polynomials we have pairwise different leading terms for different T(c) and 
are linearly independent. 


Example 12.4.15 Let A = (2, 1), asin Example 12.4.13. The standard (2, 1)- 
tableaux are given in Fig. 12.14 and are obtained for a, = 1 and a2 = (23). 


Fig. 12.14. The standard (2, 1)-tableaux 


Hence 
We, = §(2,1) = P1e Qe, — V2x 1 L]1, 


Woy = §(2,1)(23) = wae — e201 21, 
and the leading terms of wo, and wz, are respectively equal to a1 %20%1 and 


LLL. 


Exercise 12.4.16 Let k > m, k > r, and let A= (Ai,...,4,) be a partition 
of n in r parts (i.e. A, 4 0). Let 


Win = K (Vin) OWe(A) C (K(V%q))™. 


Show that W,, is a GL»,(K)-sumbodule of A(V,,) such that W,, = 0 if 
r>mand Wy = Win(A) ifr <m. 
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Hint. Embed GL,,(K) into GL,(K) fixing the variables rm41,..., 4. Use 
that the Schur functions are expressed in the language of semistandard 
tableaux and play the role of characters of the irreducible GL,,()-modules. 
Show that 

Sy(ti,.--,;¢m,0,...,0) =0,k > m, 


(no semistandard tableaux of content (n1,...,%m,0,...,0)) and 


Sy (t1,.-.,;tm,0,...,0) = Sy(ti,...,tm), k<m. 


Exercise 12.4.17 If f; and fo are homogeneous polynomials of degree n in 
K({Vn), prove that the polynomial identity f2 = 0 is a consequence of f; = 0 
if and only if fo belongs to the GL,,()-module generated by f,. 


Hint. Let W be the GL,,(A)-module generated by f,. If fo belongs to W, 
then fs is a linear combination of fi(g(#1),...,g(@m)), 9g © Glm(K), and 
fo = 0 is a consequence of f; = 0. Now, let fo = 0 be a consequence of f; = 0. 
In order to show that fo € W, it is sufficient to see that fi(g(a1),...,g(@m)) € 
W for every matrix g = (a;;) in M,,(4) (not necessarily in GL,,(K)). One 
possible way is the following. The matrix g has a presentation g = gidqgz, 
where 91,92 € Glm(K) and 


d=(1,...,1,0,...,0) => ei 


w=1 


is a diagonal matrix. Extend the action of GL,(K) on K(V,,) to an action of 
Mr (K) by o(f) = f(g(a1),.--,9(@m)) for any g € Mn (AK), f(t, ..-,%m) € 
K(Vn). Choose a multihomogeneous basis of W. For any basis element w = 
w(@1,...,€m) either d(w) = w if w does not depend on rg41,...,2%m or 
d(w) = 0 otherwise. Hence d(w) € W for all polynomials w € W. Since 
W is a GL,»,(K)-module, we obtain go(fi) € W, hence d(go(fi)) € W and 
9(fi) = 91 (d(g2(fi))) € W. 


Proposition 12.4.18 Letm >n, AF n and let Wy(A) C K(Vn). The set 
M = Wry (A) OP, of all multilinear elements in Wy,(A) is an S,,-submodule 
of Pp, isomorphic to M(A). Every submodule M(A) of P, can be obtained in 
this way. 


Proof. Since the GLy,(K)-module W,,(A) is irreducible, Exercise 12.4.17 
gives that every two nonzero elements of W,,(A) are equivalent as polynomial 
identities. Hence the nonzero elements in M = W,,(A) 9 Pp, are also equiv- 
alent. The multilinear consequences of degree n of a multilinear polynomial 
identity f(#1,...,%n) belong to the S,-module generated by f (why?). If 7 
is a direct sum of more than one irreducible submodules M,,..., Mz, then 


228 12. The Method of Representation Theory 


the polynomials in Mj, are not consequences of those in M2. Hence M is ir- 
reducible. Let T be the A-tableau, obtained by filling in the boxes of the first 
column with 1,..., 91, the boxes of the second column with g;+1,...,¢1+42, 
etc. As in Lemma 12.3.8, we can see that the polynomial «(e(7)) defined 
there is equivalent to 8, (#1,...,@m). Hence, in this special case M is isomor- 
phic to M(A). Now we use that the symmetric group S, is embedded into 
GLm(K) (permuting the first n variables and acting identically on the other 
m —n variables). If W, and W2 are submodules of A(V;,), both isomorphic 
to Wi, (A), then GL, (A) acts “in the same way” on W, and W2. Hence Sp, 
as a subgroup of GL,,(/), acts “in the same way” on M, = W, MP, and 
Mz = Wo Py, and this implies that M, = Mz = M(A). More precisely, 
every GL,,(A)-module isomorphism ¢ : W, —> W, is also an isomorphism 
of multigraded vector spaces. Hence ¢ maps the multilinear elements of W 
onto the multilinear elements of Wo, 6(Md1) = M2 and the restriction of ¢ on 
M, is a vector space isomorphism of M, to Mz such ¢(o(f)) = o(¢(f)) for 
every f © M, and every o € Sy C Gl, (K). 


Remark 12.4.19 Comparing Exercise 12.2.10 and Example 12.4.13, we see 
that the highest weight vector of any irreducible GL,,(K)-submodule of 
K({Vm) isomorphic to W,, (2, 1) is of the form 


E[v1, co]ay + nxi[ei, eo], (0,0) 4 (E,7) € K?, 


and the expression of this element is “simpler” than the expression of the 
corresponding multilinear element in M(2,1) C Ps. In many cases, it is more 
convenient to use the representation theory of the general linear group instead 
of the representation theory of the symmetric group. The next theorem shows 
that it is very easy to translate the results from one of these two languages 
to the other. 


Theorem 12.4.20 (Berele [29], Drensky [71]) Let R be a Pl-algebra and let 
xn(R) = So my (R)xy, 2 =0,1,2,..., 
AFn 


be the cocharacter sequence of the T-ideal of R. Then, for any m, the relatively 
free algebra F,(R) ts isomorphic as a Gly,(K)-module to the direct sum 


dD mal(BWinQ), 


n>0Arn 


with the same multiplicities m)(R) as in the cocharacter sequence (assuming 
that W,,(A) = 0 if X is a partition in more than m parts). The Hilbert series 
of F(R) is 


Hilb(Fin(R),t1,---5tm) = S> S 2 my(R)Sx(t1,---5tm). 


n>0AFn 
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On the other hand, if m > n and 


FLY (R) = DY) m(B)Win(A), 


for some nx(R), then 


AEn 


Proof. Let k > n and let 


FLOR) = So ny(R)We(Q). 


By Exercise 12.4.16, for every m < k 
FLOR) = S7na(R)WinQ), 


where the summation runs on all partitions X = (A1,...,Am) F n. Hence, 
it is sufficient to show that P,(R) and FS? (R) have “the same” module 
structure for m > n (ie. the same multiplicities respectively of M(A) and 
Wy, (A)). The decomposition of the n-th cocharacter of T(R) is equivalent to 
the decomposition of P,(R) 


P,(R) = 5° m,(R)M()). 

AEn 
Since m > n and X is a partition of n, every irreducible submodule W,,,(A) 
of FSD (R) intersects with some irreducible submodule M(A) of P,(R). The 
direct sum of m)(R) copies of M(A) in P,(R) generates a GL, (A )-module 
which is a direct sum of m)(R) copies of W,,(A) in FSM (R). In is easy to see 
that this means 

FAO(R) = S72 my(R)W mn (A), 
AEn 


and this competes the proof. 


Remark 12.4.21 An analogue of Theorem 12.4.20 holds also for other va- 
rieties of linear algebras over a field K of characteristic 0 and in the more 
general case when we consider factor algebras of the absolutely free (nonas- 
sociative) algebra A{X} modulo ideals U invariant under substitutions of 
linear combinations of the variables, i.e. when f(#1,...,%n) € U implies 


k 


ky n 
fh Qj1%;, - So ain) EU, aE K. 
i=l 


i=l 
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Example 12.4.22 Theorem 12.4.20 and Remark 12.4.21 allow to determine 
the S,-module structure of PL, and I, for n < 4 (see Exercises 12.3.2 
and 12.3.3) as a direct consequence of Exercise 12.4.10. For example, the 
GL»(K)-module decomposition 


BY) = Wr (3, 1) © Wn (2?) @ Win (2, 17) 6 Wn (1*) 
is equivalent to the S4-module decomposition 
I'y = M(3,1) 6 M(2”) @ M(2,1°) 6 M(1*). 
Now we shall show how to decompose BY) using highest weight vectors. We 


calculate (using e.g. the hook formula of Theorem 12.32) the dimensions of 
the irreducible S;-modules 


dimM (4, 1) = dimM (2, 1°) = 4, 
dimM (3,2) = dimM (2?, 1) = 5, dimM(3, 1°) = 6. 


Since 


diml’s = 44 = dimM (4, 1) + 2dimM (3, 2)+ 
+2dimM (3, 1°) + 2dimM (2, 1) + 2dimM (2, 1°), 


it is sufficient to find in BY? a highest weight vector w4,1) and two linearly 
independent highest weight vectors wi, and w for every A = (3,2), (3, 1°), 
(27,1), (2,18). This would guarantee that the GL, (K)-module 


Win (4, 1) 6 2Win (3, 2) © 2Wm (3, 1°) @ 2Wyp, (27, 1) © ZW (2, 1°) 
(5) 


is a submodule of By’, hence 
M(4,1) @ 2M(3, 2) @ 2M (3, 1°) 6 2M (2?, 1) 6 2M(2, 1°) 


would be an S5-submodule of I’;. Since the dimension of this S;-submodule 
is equal to the dimension of I's, we obtain that it coincides with I’;. The 
element 

8(4,1) = $2(*1, vo)x? 


is a highest weight vector of (K(Vm))©. We define a GLm(K)-module ho- 
momorphism ¢ : (K(Vn))© —; BY by 


Qo: Li, ...Lig > [@i,,---, Lis] 
and obtain that 
9(s(4,1)) = War) = —2[22, 21, 41,01, 21] #0 


in BS). Hence Wr (4, 1) C BE). Similarly, for \ = (3, 2) we define GL», (K)- 
module homomorphisms ¢; : (K(Vin))©) — BE), i= 1,2, by 
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Pt 1 Bj, sig [vi Vio, Xi5|[Lis; ria] 


2 1 Bj, sig [vi £;,][2is, Cia, is] 


and the polynomials 


$1(8(3,2)) = 4[@2, #1, €1][w2, €1], G2(5(3,2)) = 4[ee2, vi] [e2, 11, 1] 
are linearly independent highest weight vectors. Hence 2W,,(3,2) C BE). 
The considerations for the other \’s are analogous. 
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In Section 4.3 we have established relations between the ordinary and the 
proper polynomial identities. It is naturally to expect that the information 
about the representations of S, and GL,,(K) related with the polynomial 
identities of a Pl-algebra R can be expressed in terms of the corresponding 
representations related with the proper identities. It turns out that such 
relations do exist and they require deeper results on representation theory 
of GL,,(K) than these used in Section 12.4. In particular, we need some 
information on tensor products of GL, (A )-modules. 


Exercise 12.5.1 Show that the tensor product W, @ W2 of two polynomial 
Glm(K)-modules W; and W, is a polynomial module again. 


Hint. Take bases {w; | 7 € I} and {w/ | j € J} of Wi and Wa, respectively. 
Use that for g € GL (K) 


Iwi) = YF apiwh, (wf) = SF Baie), 


pel qed 


where ap; and (4; are polynomials of the entries of g, implies that 


g(w; ® wi) = S- Opi Bg; (Wp @ wy): 


Derive from here that the entries of the matrix corresponding to the action 
of g on W; ®Ws (with respect to the basis {uj @w/ |7€ I,j € J}) are again 
polynomials of the entries of g. 


The decomposition into a sum of irreducible GL,, (A )-submodules of the 
tensor product W, ®@ W, of two irreducible polynomial G'L,,(K)-modules W, 
and W», can be obtained by the combinatorial Littlewood-Richardson rule 
(see e.g. the book by Macdonald [175]). There is a simplified version of this 
rule which gives the decomposition when one of the modules is isomorphic to 


Wr(n) or Win (1). 
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Theorem 12.5.2 (The Young Rule) The tensor products of the irreducible 
Glhm(K)-module Wm (A1,...,Am) with Win(q) and W,, (12) (in the latter case 


q<m) are decomposed into the following sums of irreducible components: 
Wr(A1, e ., Am) ® Wn(q) = Wia(Ar + Pi, e .,Am + Pm), 


where the summation is over all nonnegative integers p1,...,Pm such that 
pit...+pm=q and 44+ pi < Ay-1, 7= 2,...,m; 


Wm(A1,.--,Am) & Wr(1?) > Wia(Ar + €1,...,Am + £m), 


where the summation is over alle; = 0,1, such that ¢, +... +é€m = q and 
AG+ & < Ag—1 + &5-1,; 7=2,...,m. 


In other words, W,(A) © Win (gq) and Wi,(A) @ Wy, (12) are direct sums 
of pairwise nonisomorphic irreducible GL,,(K)-modules. The diagrams [;:] 
corresponding to the irreducible components of Wy, (A) ® Wm (¢g) are obtained 
from the diagram [A] by adding g boxes in such a way that no two new boxes 
are in the same column of [py]. For Win (A) @ Wy, (12) the new ¢ boxes are not 
allowed to be in the same row. When q = 1 this is the well known Branching 
Theorem. 


Example 12.5.3 Let \ = (2,1) and ¢g = 2. Then (see Fig. 12.15 and 12.16), 
for m > 4, 


Wm (2,1) @ Win (2) = Win (4, 1) ® Win (3, 2) © Win (3, 17) B Win (2?, 1), 





Fig. 12.15. The decomposition of Wi,(2,1) @ Wm(2) 


Wm (2,1) ® Wn (17) = Win (3, 2) @ Win (3, 1°) @ Win (27, 1) B Win (2, 1°). 


The following theorem gives the relationship between the ordinary and 
proper cocharacters of a Pl-algebra. As in Chapters 4 and 5, it allows to 
simplify the calculations. 
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Fig. 12.16. The decomposition of Wy(2,1) @ Win(1?) 


Theorem 12.5.4 (Drensky [74]) Let R be a unitary Pl-algebra and let 


xn(R) = So my (R)xy, 2 =0,1,2,..., 
AFn 


Np (R) = XPp(R) = So ky(R)xv, p= 0, 1, 2, sey 
vl p 


be, respectively, the ordinary and proper cocharacter sequences of the T-ideal 
T(R). Then the multiplicities m,(R) and k,(R) are related by 


ma(R) = > kv (R), 


where for X = (Ai,...,An) the summation runs on all partitions v = 
(¥1,...,Un) such that 


Ay >My > Ae > 9 >... > An > Un: 





Proof. By Theorem 4.3.12 (i), the Hilbert series of the relatively free algebra 
Fy (R) = K(a,...,@m)/(K(01,...,%m) A T(R)) 


and its proper elements B,,(R) are related by 





Hilb(Fin(R), t1,---,tm) = Hilb(Bm(R),t1,--.,tm) T] u 


The graded vector spaces with finite dimensional homogeneous components 
are determined up to isomorphism by their Hilbert series. Hence, as a multi- 
graded vector space, F,,(R) is isomorphic to the tensor product of By, (R) 
and the polynomial algebra K[V,,]. By Corollary 12.4.8, the Hilbert series 
of F,,(R) determines the GL,,(K)-module structure of F(R). Hence, as 
GL m(K)-modules, 
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F(R) & Bm(R) @ K[r1,..., 2m]. 


On the other hand, since the Hilbert series of the vector space of all homoge- 
neous polynomials of degree n is equal to the complete symmetric function 


lin(tiy-+5tm) = So tPA, my + +m = 1, 
Exercise 12.4.6 gives that as GL,»,(K)-modules 


K[e1,...,&m] = SS" Wn (n). 


n>o 


By the equivalence between representations of S, and GL»,(K) (Theorem 


12.4.20 and Remark 12.4.21), 


F(R) = $5 m(R)Wm(A) = S22 ky R) Win (v) ® Wn (p)- 


vo p>o 


Now we apply the Young rule of Theorem 12.5.2. The module W,,,(A) par- 
ticipates in the decomposition of W,,(v) ® Wi,(n) if and only if there are no 
boxes in the same columns in the set difference of Young diagrams [A] \ [v], 
i.e. if the lengths of the rows of [A] = [Ai,..., An] and [v] = [/1, ..., vn] satisfy 
the inequalities 

Ay >My > Ae > 9 >... > An > Un: 





Using again the equivalence between representations of S,, and GL»(A), we 
obtain the desired decomposition. 


Now we shall apply Theorem 12.5.4 to calculate the cocharacters of some 
important algebras already considered in the previous chapters. We start 
with the cocharacters of the Grassmann algebra. The original theorem is due 
to Krakowski and Regev [155] and is based on representation theory of the 
symmetric group. An alternative exposition can be found in the paper by 
Ananin and Kemer [14]. 


Theorem 12.5.5 (Krakowski and Regev [155]) Let E be the Grassmann 
algebra of an infinite dimensional vector space. Then the cocharacter sequence 
of the polynomial identities of E is 

n—-1 

xn(B) = So xin-n,1¥), 

k=0 
i.e. the irreducible S,-submodules of P,(E) are with multiplicity 1 and cor- 
respond to Young diagrams lying in a hook with height of the arm and wide 
of the leg equal to 1. 


Proof. We apply Theorem 5.1.2 (iii) (and its proof). The Hilbert series of 
Bm(E) is 
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Hilb(Bm(E),t1,-..,tm) = > ear(ti,.-..tm), 
k>0 


where €p(t1,...,¢m) 1s the elementary symmetric polynomial of degree p. By 
Exercise 12.4.6, 


Hilb(Bm(E),t1,---,tm) = $— Sao) (ta, ---5tm), 
k>0 
and By» (E) is a direct sum of the GL, (K)-modules W,,(17*), & = 0,1,2,... 
Hence, by Theorem 12.5.4, yn(/) is a sum of the irreducible $,-characters 
X,, where the diagram [A] is obtained from the diagram [v] = [1°*] by adding 
several boxes to the first row and, eventually, building a new row of one box 
(with odd number of the row). Hence \ = (a, 1°) n, where a+b=n. 


The next theorem describes the GL,,(A)-module structure of the proper 
elements of the relatively free algebra Fin (U,(K)) of the variety of associative 
algebras generated by the algebra of k x k upper triangular matrices. 


Theorem 12.5.6 (Drensky and Kasparian [91]) Let U,(K’) be the algebra of 
k x k upper triangular matrices. Then 


k-1 
Bm(Uk(K)) = SS Win (pt = 1,1) @...@ Win (pr = 1,1). 
r=0 p,>2 


Proof. First, let k = 2. By Theorem 5.2.1, BY?) (Us(K)), p > 0, has a basis 
[@i1, ig, i], ty > 19 < ig < Lees bp - 


We define a 1-1 correspondence between the basis elements of BY?) (Us(K)) 
and the semistandard (p — 1, 1)-tableaux by 


—, fe fis] Tip | 
[®i1,®ig,.--, i, P 


al 


Clearly, the image of a basis element of degree a = (a1,..., @m) is a semistan- 
dard tableau of the same content a. Hence the Hilbert series of BY) (Uo(K)) 
and the Schur function S(p_1,1)(t1,-..,tm) are equal. Bearing in mind that 


B\) (Us(K)) = K and the Hilbert series determines the GL,,(K)-module 
structure, we obtain that 


Bm(U2(K)) = K 6 S> Wn (p — 1,1). 


Applying again Theorem 5.2.1, we obtain that, as a graded vector space, 
Bm(Ux(K)), & > 2, is isomorphic to 
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Bm(Ur—1(K)) ® $2 Wp — 1,1) @...@ Wn (Pei — 1,1) 
pir2 


and, by induction, this completes the proof of the theorem. 


Remark 12.5.7 In order to obtain the cocharacter sequence of T(U;(K)), we 
need to know the decomposition of the tensor products Wy, (A)@ Wm (p—1, 1). 
For p = 2 we can apply the Young rule of Theorem 12.5.2. For p > 2 we need 
the Littlewood-Richardson rule. 


Exercise 12.5.8 Show, that as a GL,,(K)-module, By, C K (Vin) is isomor- 
phic to the tensor product 


S75 Wn(pi — 1,1) @...@ Win(Pr = 1,1). 


r>0 p.>2 


Hint. Apply Theorem 12.5.6. Use that the minimal degree of the polynomial 
identities of U;, (A) is equal to 2k and hence BY) > BY? (U,(K)) for 2k >n. 


Exercise 12.5.9 Using Example 12.5.3 and Exercise 12.5.8, decompose once 


again BY forn=4andn=5. 


We have paid so much attention to the decomposition of the GIy, (K)- 
module B,, of the proper polynomials in the free algebra K(V,,), because 


the knowledge of the decomposition of BS) for small n is very useful for the 
description of the polynomial identities of PI-algebras satisfying identities of 
low degree. It also helps to make conjectures for the cocharacters of T-ideals. 


Exercise 12.5.10 Compute the S,,-cocharacter of the 2 x 2 matrix algebra 
Mo2(K) for n= 5. 


Sketch of Solution. We use the decomposition of BY) for n <5, obtained in 
Exercise 12.4.10 and Example 12.4.22. Let m > 5. Since the only polynomial 
identity of degree < 4 for M2(K) is the standard identity s4, which generates 


the GL»(K)-submodule W,,(1*) of BY) we obtain that. BS) (M2(K)) = 
BY for n < 3, 
By) (M2(K)) = Wm(0) = K, BL) (M2(K)) = 0, 
By) (M2(K)) = Wn(1?), BY? (M2(K)) = Wn(2, 1), 
BY) (Mo(K)) = BY) /Wr(1*) = Wn (3,1) ® Win (2?) © Wn (2, 1°). 
Now we choose the following highest weight vectors of the irreducible com- 
ponents W,,(A) of BY). 
A = (4, 1): 
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W(4,1) = [@2,%1,%1, 01,01], 
A = (8, 2): 
w(3,2) = [[v2, v1, 21], [v2, ei], w(3,2) = [r2, 21,1] © [x2, v1], 
A = (3, 1°): 


W(3,12) = > (signe) [[vo(1), #1, £1], [%o(2), Zo(ay]], 
oESs 


wls.12) = So [toc 1, 01] © [o(2), to(syh 
oESs 
A = (27,1): 


W921) = > (signe) [[r2, 21, to(1)], [to(2), Foca), 
oESs 


Wi21) = S- [22,21, to(1)] © (toa), Posy) 
oESs 


A = (2,19): 


Wlo,13) = > (signe) [[to(1), £1, Lo(2)], [Vo(a), Loay]], 
oES4 


wlrs) =). (signe) [to(1), #1, ®o(2)] © [to(a), Po(ay]- 
oES4 


Replacing 2, by a diagonal generic 2 x 2 matrix and #2,...,@%m by arbitrary 
generic matrices we see that 


wa) #9, w(3,2) #0, w3 12) # 0, w(o21) #0 


in M2(K) and 


W,2) = W312) = W(o2,1) = Mears) = Vrs) = 9 


are polynomial identities for M2(K). Hence (why?) 
B\®) (Mo(K)) & Win (4,1) ® Win (3, 2) @ Win (3, 12) @ Wn (2, 1). 
Applying Theorem 12.5.4, we obtain that 
x5(M2(K’)) = x5) + 4x(4,1) + 4X (3,2) + 5x (3,12) + 4x (22,1) + X(2,18)- 
Exercise 12.5.11 (Partial case of a result of Popov [209]) Compute the 


Sp-cocharacter of the tensor square F © EF of the Grassmann algebra F for 
n=. 
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Hint. Use Exercise 2.1.6. In the notation of Exercise 12.5.10, show that 
the polynomials W312) W921) W(o,19) are consequences of the centre-by- 


metabelian identity. Prove that w/, 4) is also a polynomial identity for B® E. 


Show that neither the other highest weight vectors nor the highest weight 
vectors of BY vanish on # @ Ef. Hence 


BOE @ E)= BY n<, 


BY) (BE @ BE) = Wn(4,1) ® Wn(3, 2) © Win (3, 1?) ® Win (27, 1) ® Win (2, 1°). 
Finally, apply Theorem 12.5.4. 


Exercise 12.5.12 Prove the following theorem of Regev [224]. The algebra 
R satisfies the Capelli identity in & skew-symmetric variables if and only if 
the multiplicities m,(R) in the cocharacter sequence of R are equal to 0 for 


all A= (Aj,..-,An) with Ap £ 0. 


Hint. Use that for A, # 0, the highest weight vectors wa of Wn (A) C K(Vin) 
contain sums with > k skew-symmetric variables, i.e. the Capelli identity 
implies w, = 0 in R and m)(R) = 0. On the other hand, let all multiplicities 
m(R) be equal to 0 for A, 4 0. We generate a GL, (K)-submodule of K (Vin) 
by the Capelli polynomial. It is equal to 0 for m < k, and, by Exercise 12.4.16, 
for m > k, it decomposes as a direct sum of W,,(A) with A, 4 0. Hence the 
Capelli polynomial belongs to T(R). 


Exercise 12.5.13 Let R be a Pl-algebra and let dimR = p. Prove, that in 
the cocharacter sequence of the T-ideal T(R) 


xn(R) = Som (R)xy, 2 =0,1,2,..., 
AFn 
m(R) = 0, if Apgi #0 in X= (A1,---, An). 


Hint. Use that R satisfies the Capelli identity in p+ 1 skew-symmetric vari- 
ables and apply Exercise 12.5.12. 


Exercise 12.5.14 Let R be a unitary Pl-algebra with centre C’ and let 
dimR = p, dimC' = q. 
Prove, that in the ordinary and proper cocharacter sequences of the T-ideal 
T(R) 
xn(R) = Som (R)xy, 2 =0,1,2,..., 


AEn 


Mn (R) = XPn(R) = So ky(R)xv, n= 0, 1, 2, sey 


ven 
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m)(R) = ky(R) = 0, if Ap-geo A OM A= (A1,...,An) and Yyp—g41 #0 in 


v=(M,...,Un). 


Hint. For the proper cocharacters use that every proper polynomial with 
p—4q+1 skew-symmetries vanishes on R. For the ordinary cocharacters 
apply Theorem 12.5.4. 
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Since the relatively free algebra F,(@M;,(K)) is isomorphic to the algebra 
generated by m generic k x k matrices, invariant theory of matrices was 
successfully involved by Procesi [214] and Razmyslov [219] in the study of 
the T-ideal T(M;,(4)) of the polynomial identities of k x k matrices over 
a field of characteristic 0. A survey can be found in the book by Formanek 
[108]. 

Up till now the results on T(M,(K)), & > 2, are far from their final 
form and only the structure of the identities of the 2 x 2 matrices is well 
known. In a series of papers (see [226] for a survey) Regev obtained the 
asymptotic behaviour of c,(Mo(K)) and yn (Mo2(K)); the explicit expression 
of ¥n(Mo(K)) was found by Formanek [105] and the author [74]; Procesi 
[215] computed the codimension sequence of M2(K). Formanek, Halpin and 
Li [109] for m = 2 and Procesi [215], Formanek [105] and Drensky [74] in the 
general case computed the Hilbert series of Py,(M2(K)). 

One of the possible ways to study T(M2(K)) is via the generic trace 
algebra generated by m generic 2 x 2 matrices together with all the traces of 
the generic matrix algebra F,,(M2(A)) and the approach of Regev, Formanek 
and Procesi follows this way. For a survey on the 2 x 2 generic trace algebra 
see the book by Le Bruyn [169]. Till the end of the chapter we give as an 
application of the method of representation theory of GL,,() an exposition 
of the quantitative results for T(M2(AK)) without any invariant theory. The 
method works also for other PI-algebras and we refer to the survey [77] for 
further applications. 


Exercise 12.6.1 Let 
ay = €11 — €92, Go = 19 + €21, 43 = €12 — €21 © Mo(K) 


and let a;,...a;, be a product which contains n; times aj, nz times a2, ng 
times a3, m1 + no +ng =n. Let 6; = 0,1, e; =n; (mod 2), ¢ = 1,2,3. Show 
that 

aj, = taj asa;? 


n 





Gj, es 


Hint. Show that 
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a1 42 = —241 = 43, 4243 = —A302 = —@1, 4341 = —a14a3 = —G2, 


2_ 42 _ 2_ 
ay = a5 = €, a3 = —e. 


The equations of the first line give that we can rearrange the elements a;, 
and the equations of the second line give the desired result. 


The following theorem of Drensky [71, 78] gives a simple test verifying 
whether a highest weight vector in B,, is a polynomial identity for Mo(K). 


Theorem 12.6.2 Let wy(@1,...,%m), = (f,---, Hm), be a highest weight 
vector of the submodule W,,(j) of the GL»,(K)-module By, of the proper 
polynomials in K(Vm). If wa #0, then wy(21,...,2m) ts a polynomial iden- 
tity for Mo(K). When ps = 0, then wy (21, 2,23) is a polynomial identity 
for M2(K) if and only tf wz(a1, a2, 43) = 0, where the matrices a), a2, a3 are 


defined by 
_f1 0 fo 1 _f 0 1 
HAVO ape @ =a ofS \-1 o}° 


Proof. By Exercise 12.1.10 (i), the centre of Mz(A) consists of all scalar ma- 
trices and hence is one-dimensional. Since dimM2(A) = 4, Exercise 12.5.14 
gives that the GL,, (A )-module B,, (M2(A’)) is decomposed into a sum of irre- 
ducible GL,,(4)-modules W,p (#41, #2, #3) and this implies that w, € Wm (ps) 
is a polynomial identity for Mz(K) if 4 4 0. Now, let 4 = 0. By Exercise 
7.2.2, Wy(@1,€2,%3) € T(Mo(K)) if and only if wia(y,ye,y3) = 0 for the 
generic matrices y1, y2, y3. By Exercise 7.2.7 we may assume that y, is a di- 
agonal matrix and yo is symmetric. The matrices €, a1, @2, a3 form a basis of 
Mo(K). We rewrite y1, yo, y3 as their linear combinations: 





Yi = bo1€ + Ea, yo = bore + G12d1 + Nae, yz = Goze + b1341 + b23G2 + Cas, 


where 4;;,€,7,¢ are commuting variables. All elements of w,(y1, yo, y3) are 
in commutators only and we may assume that ¢@9); = 0. The polynomial 
w, (#1, £2, £3) 1s multihomogeneous of degree i; in x;. By the skew-symmetry 
of wy(#1,%2, 23) described in Theorem 12.4.12 (11), we obtain that 


wWyulyi, ¥2,Y¥3) = Wu(Ear, nas, Cas) = Enh? CM wy (a1, aa, as) 
and wy(yi, ye, ¥3) = 0 if and only if w,(a1, a2, 43) = 0. 
Remark 12.6.3 The matrices a), a2,a@3 in Theorem 12.6.2 can be replaced 


by the matrices 


1 1 1 
bj = 5 (eu €92)V 1, bg = ple + en)V—L, b3 = ple _ €91). 


The table of multiplication of these matrices is the following 
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b b b 
by be = —b2b, = > bobs = —bgbe = > b3b1 = —b1b3 = > 








[b1, b2] = b3, [b2, bs] = b1, [bs, bi] = bo, bj = 5 = b3 “7 
and this will simplify the computations in the next theorem which describes 
the GL,,(K)-module structure of the proper polynomials in Fj, (M2(K)). 


Theorem 12.6.4 (Drensky [71]) The GL,y,(K)-module of the proper polyno- 
mials in the relatively free algebra Fy,(Mo(K)) satisfies 


By (Mo(K)) = Brn [( Bmw VT (M(B) © > Win (115 fas Ha): 


where the summation is over all partitions p = (f1, Ha, M3) different from 
y= (13) and p=(n), n>1. 


Proof. Let 
Mo(K)) = So key(M2(K))Win (11). 


( 
By Theorem 12.6.2,’ ” o(K)) = 0, if w = (i,..., pm) and pg # 0. Now, let 
He = (p44, fo, Hs) and let w, (1,82, #3) and wii (#1, 22, 23) be highest weight 
vectors of two isomorphic GLy,(K “)-submodules Wry) of Br C K Wm ) 
and let wi, € Bn O T(Mo(Kx)). Then, by Exercise 12.6.1, (w), and wi are 
multihomogeneous), there exist constants a’ and a” in K such. that 


wi, (a1, ay, a3) = a’a}!as7a3°, wi (ar, a2, a3) = aa}! a57a3°, 
where ¢; = 0,1, ¢; = py (mod 2), i= 1,2,3. By Theorem 12.6.2, a’ #0. On 
the other hand, 


wy(£1, £9, t3) = aw), (#1, £9, t3) —_ a! wy (a1, £9, £3) E Bm 


vanishes for «; = a;, i = 1,2,3, and by Theorem 12.6.2 again, w, (#1, 2, #3) 
is a proper polynomial identity for Mo(). Therefore, w), and wi are linearly 
dependent modulo B,, 1 T(M2(K)) and k,(M2(K)) < 1. The proof will be 
completed if we show that k,(M2(K)) = 0 for # = (1°) and p= (n), n> 1, 
and if we construct nonzero highest weight vectors in By,(M2(K)) for all 
other partitions 4 = (ft1, 2, #3). The multiplicity of W,,(n) in K(V,) is 1 
and the corresponding highest weight vector is #7. Clearly 2? ¢ B,, forn > 1 
and, therefore, ki,)(M2(K)) = 0. Similarly, k(1s)(M2(A)) = 0 because the 
only submodule W,,(13) of K (Vm) is generated by the standard polynomial 
$3(@1,%2,%3) which does not belong to Bm. Now, let p = (11, fa, fg) be 
another partition. We assume that yet n > 1, since Wi, (ys) = A for pw = (0). 
Let 


Wy = 82(#1, €2)(adx,)"—"? 83 (ada, adxy, adag)"? so(x1, ay) Hest 


when po # pg and, for po = ps, 
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Wp = S> (signe) eo) (ade)! "83 (ade, aderz, ades)"*~'[25(2), €o(3)]- 
oESs 


The polynomial w, has the desired skew-symmetry and all variables are in 
commutators. Hence w, is a highest weight vector of Win (2) C Bm. Direct 
verifications show that w, (61, 62, 63) # 0 for the matrices 61, b2, b3 in Remark 
12.6.3. Therefore, wi ¢ Bm OT(M2(K)). 


Now we are able to give an explicit formula for the cocharacters of the 
T-ideal of the matrix algebra Mo(K). 


Theorem 12.6.5 (Formanek [105], Drensky [74]) The cocharacter sequence 
of the T-ideal T(M2(K)) ts 


Xn(Mo(K)) = Som (Mo(K))xa,n = 0,1,2,..., 
AFn 


where X = (Aj, A2,A3, Aa) and 
(i) Mn LK =1; 
(iii) m Monet )( Walk) = (2 _ Aa) _ 1; 
(iv) my (Mo(K)) = (Ar — Ao + LW) (A2 — Aa + LD) (As — Aa +1) for all other 


partitions. 





Proof. We apply Theorem 12.5.4 to the decomposition of By,(M2(A)) in 
Theorem 12.6.4, bearing in mind that BS) (M2 (K)) and I,(M2(K)) have 
the same module structure (see Theorem 12.4.20 and Remark 12.4.21). Let 
A = (Ai,...,;Am) be a partition of n. By Theorem 12.5.4, the multiplicity 
m(M2(K)) in the cocharacter sequence of Mz(K) is equal to the sum of 
all multiplicities k,(Mo(K)), w = (f1,.-.,@m), where Ay > pas > Asi. By 
Theorem 12.6.4, k,(Mo(K)) = 1 for w = (44, Ho, us), w # (p), p > 0, and 
pe # (19) and ky(M2(K)) = 0 in all other cases. Clearly, m),(Mo(K)) = 0 if 
As # 0 and we have to handle the case A = (Aj, A2, A3, Aa). Let us assume 
that k,(M2(K)) = 1 for all w = (41, fo, 3). For a fixed A we obtain that the 
contribution to m)(Mz(K)) is due to those y which satisfy 


fa = Ag, Ag t1,...,A15 fe = AB, AR 4+1,...,A2; wg = Ag, Ag +1,...,A3. 


Hence we obtain 





(Ay — Ag + D(A2 — As + L)(A3 — Aa +: 1) 


possibilities to choose ys and this gives the multiplicity of m)(M2(K)) in the 
general case. Now we have to subtract the contribution of the partitions pp = 
(p), p > 0, and y = (1°), because for them k,(M2(K)) = 0. For # = (p) this 
concerns A = (n) (n possibilities for p = (p), p = 1,2,...,”, to be subtracted) 
and A = (Aq, Az) (Ar — Ag +1 possibilities for u = (p), p = Az, A2 +1,..., A). 
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Similarly, = (1°) concerns A = (Ay, 17) and A = (Ay, 1°), when we subtract 
1. Easy calculations complete the proof. 


Theorem 12.6.2 together with Theorem 4.3.12 allows to calculate the 
Hilbert series of F,(Mo(K)) and the codimensions of Mo(K). 


Exercise 12.6.6 Prove the theorem of Formanek, Halpin and Li [109] that 


1 


. - _ tits 
Hi FMI 4.19) = Tay (4 TTT) 


Hint. Use that 
Hilb( F2(Mo(K)),t1,t2) = 


= aoc SoS! Siptaia) (ts #2) — SS Sip) (tas te) | 








peo g>o pol 
S(ptaa) (tr, t2) = (tate) (Gh + te + FET +) = 
peti _ peti 
= (tt.)i+ 2 
(tit2) ote 


Compare this proof with the original proof of Formanek, Halpin and Li in 


[109]. 


Remark 12.6.7 Since dimM2(K) = 4, Exercise 12.5.13 gives that the 
cocharacter sequence of M2(/) contains only irreducible characters corre- 
sponding to A = (Aj, A2,A3,A4) and the multiplicities of y,(M2(K)) are 
completely determined by the Hilbert series of F'4(M2(K)). Formanek [105] 
proved that 


Hilb(Fy(Mo(K)), ty, ta, ty, ta) T—] to titetata oy) 
1 4 2(40)), 01, 62, 63, ba) = _ 
iz (l- +)? I<; — tt;) 





4 
1 
+ TT] ay (= colt tats. ta)) 
t=1 


Another description of the Hilbert series of F,,(Mo(/‘)) is given in [215] and 
[74]. 


Theorem 12.6.8 (i) (Procesi [215]) The codimension series and the codi- 
mension sequence of Mo(K) are the following: 


o{MaC).2) =O enl MaL CE = 75 (1 ot — J/1 — 4t 
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e , 1 
(l-t)}# ° 1-t 1-28’ 


Cn (Mz(K)) = — rr) - (3) 41-2", 


(ii) (Regev, see [226]) The n-th codimension cy(M2(K)) satisfies the 
asymptotic equality 





qnt+l 


en(Ma(K)) 





Proof. (i) By Theorems 4.3.12 (ii) and 12.6.4, it is sufficient to compute the 
proper codimension series 


9(Mz(K),t) = So yn (Me(K))t* = 
k>0 


= S730 dimM (p11, #2, ps)t* — S> dimM (k)t* — dimM (1°)t? 
k>0 pbk k>1 


In order to calculate the first sum, let us consider a hypothetic Pl-algebra R 
such that 


Bm (R) = Bm/(Bm OT(R)) =) Win (P? 


As in the proof of Theorem 12.6.5, we obtain that 


= SS Wan (Ha Ha, Ha)- 


k>0 wrk 
In virtue of Theorem 4.3.12 (ii), 
(R,j=— Re d 7(R,t) = S° dimM (p 
c(R,t)= 77 | RT) an ¥( im 
p20 
Applying the hook formula of Theorem 12.2.12 (ii), we obtain 
1 2p 
dimM 
= pt+l val; ) 
and, as a consequence, 


7(R,t) = > (*”) meet (1- V1 48). 


x p+l1  2t? 





Easy calculations complete the proof, because 


1 
So dimM (k)t* + dimM (1°) = Tait e. 
k>1 
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The assertion for the codimensions is equivalent to that for the codimension 
series and (ii) follows from the Stirling formula 


nia J/2rn (=) . 
€ 


Remark 12.6.9 Using his method, Regev (see [226]) determined the asymp- 
totic behaviour of the codimension sequence of M,(K) for any k > 1 and 
showed that 


Cn(Mp(K)) & (2m) F-29012 1191 = IRF) BY)? pant? 


Exercise 12.6.10 Prove the theorem of Drensky [71] for the cocharacter 
sequence of the Lie algebra sly(A): 


Xn (sle(K)) = Sous n> I, 


where the summation runs on all partitions A = (Ay, A2, A3) F m such that 
Ay #0 for n > 1 and at least one of the integers 41 — Ag and A2 — Ag is odd. 


Hint. Use that the homogeneous components of degree n > 2 of the free Lie 
algebra Ly, are contained in BY) C K(Vn) and in the proof of Theorems 
12.6.2 and 12.6.4 we have worked with the basis {a1, a2,a3} of the vector 
space slo(K). Repeating the main steps of the proof of Theorem 12.6.4, show 
that the nonzero highest weight vectors w (#1, #2, #3) can be chosen to belong 
to Lm if one of the integers Ay — Az or Az — Ag is odd. Show that w (a1, aa, a3) 
are central elements of Mo(’) if both Ay — Az and Az —A3 are even and hence 


Ww (a1, G2, 43) € slo(K). 
Exercise 12.6.11 Calculate the Hilbert series 
Hilb(F'2(sla(K)), t1, ta) 


(the result is a theorem of Bahturin [20]). 
Hint. Apply the previous exercise in the spirit of Exercise 12.6.6. 


Exercise 12.6.12 Let [(M2(K)) be the ideal of weak polynomial identities 
of M2(K) (see Definition 7.3.6). Prove the theorem of Procesi [215] for the 
GL »(K)-module isomorphism 


K(Vin)/(K (Vin) O1(Me(K))) = S2 Win (Ar, A2, As): 


Hint. Repeat the arguments from Exercise 12.6.10 using the highest 
weight vectors 
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wy (#1, @2, 03) = 83° (#1, t,£3)[a1, ee)? ap 2, 


Remark 12.6.13 Razmyslov [217] showed that the ideal J(M2(K)) of weak 
polynomial identities for M2(K) is the smallest ideal of A(X) which is 
closed under substitutions of Lie elements and contains the polynomial 
[z?, x2]. Drensky and Koshlukov [93] proved that [(M2(K)) is the small- 
est ideal of A(X) which is closed under substitutions of linear combinations 
of the variables (i.e. is a GL(span(X))-module) and contains [27,22] and 
$4(@1,%2,%3,%4). A similar theorem over an infinite field of positive char- 
acteristic was established by Koshlukov [151]. Compare these results with 
Exercise 7.4.4 and its hint. See also [84] for relations with invariant theory of 
matrices. 


Remark 12.6.14 By the theory of Kemer (see Theorem 8.4.10), the simplest 
T-prime ideals are the commutator ideal and those of the Grassmann algebra, 
the 2 x 2 matrix algebra and the algebra M1. It is known that T(M,1) = 
T(E ® E). Popov [209] proved that 


T(E @ EB) = ([l1, €2], [a3, va], #5], [[z1, v]?, ei] 


and determined the S,-module structure of the proper multilinear polynomial 
identities of B® E: 


n(E® BE) = 5° M(p, 24,1") 


where the summation runs on all partitions (p, 22,1") F n except the trivial 
cases p= n > 0, q = r = 0 (which corresponds to the linearization of «7 
and M(n) is not contained in I’,) and (1°*+") (related with the standard 
polynomial of odd degree which is not proper again). 


Exercise 12.6.15 Calculate the cocharacter sequence of the tensor square 
E® FE of the Grassmann algebra FE. 


Hint. Use the result of Popov in Remark 12.6.14 and apply Theorem 12.5.4 
as in the proof of Theorem 12.6.5. 


Exercise 12.6.16 Let E(V2) be the Grassmann algebra of the two-dimens- 
ional vector space V2. Calculate the cocharacters and the codimensions of the 


algebra F'(V2) @ E(V2). 


Hint. This is a partial case of a result of Di Vincenzo and Drensky [64] which 
describes the proper cocharacters of F @ E(V;) and E(V;,) @ E(Wj), k,l > 2. 
Show that E(V2) © E(V2) satisfies the polynomial identities 


[1, €2,%3, €4] = [1, 2, €3][e4, &5] = [e1, eo] [v3, 24, e5] = 0 
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[@1, to] [x3, val[as, r6| =—0 


and derive that 
Bm (B(V2) @ E(V2)) = K ® Wm (17) 6 Win (2, 1) 8 Win (27) @ Win (1°). 


Another possibility is to use that E(V2) @ E(V2) satisfies all polynomial iden- 
tities of FE © EF and to apply the result of Popov in Remark 12.6.14. One 
can see that all proper polynomials of degree > 5 vanish on E(V2) @ E(V2) 
and check which proper highest weight vectors of degree 4 are identities for 
E(V2) @ E(V2). One can show even more, that the T-ideal of K(X) gener- 
ated by the polynomial [21, x2, #3, #4] coincides with the intersection of the 
T-ideals T(/) and T(E(V2) @ E(V2)). See the paper by Volichenko [262] for 


the description of ([@1, @2, #3, @4])° . 


Exercise 12.6.17 Let Cy = K[é]/(t*) be the polynomial algebra in one 
variable modulo the ideal generated by t*, k > 1. Let 


_{c te 
Ri = (ic C ) 
be the subalgebra of the 2 x 2 matrix algebra with entries from C, i.e. Ry con- 


sists of all matrices such that the “other” diagonal contains only polynomials 
without constant terms. Calculate the proper cocharacters of the algebra Rx. 


Hint. Since R;, satisfies all polynomial identities of M2(A), we obtain that 
Bm (Rx) is a homomorphic image of B,,(M2(K)). Repeating the arguments 
of the proof of Theorem 12.6.4, show that Wy» (p41, #2, #3) C Bm( Re) if and 
only if po +p3 < k. The algebra Rg plays a role in noncommutative invariant 


theory (see [82] and [148]). 


Exercise 12.6.18 Let R be the algebra of all matrices 


a 712 Y13 
0 6 ys ],0,8,%7 © K. 
0 O a 


Calculate the proper cocharacters of R. 


Hint. Working modulo the centre of R, we may assume that the noncentral 
elements of R are of the form 


Geo + y12€12 + 23623. 


By Exercise 12.5.14, the proper cocharacters are decomposed into a sum of 
irreducible S,-characters y, with A4 = 0. Use that R satisfies the polynomial 
identities 

[ay, £9\[r3, rales, r6| = [[ara, xo|[2x3, ral, £5] =0 
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and hence all polynomial identities of EB @ E. Apply Remark 12.6.14 and 
derive that 
Bm(R) C S> Wp, 24,1’). 
qtr<2 


Show that for Ax + A3 > 2 the highest weight vectors w, € By, vanish on R 
because involve more than two times the entries of e;2 and e23. Answer: 


Bm(R) =K @S> Walp 1,1) @ So (Wn(p — 2, 2) © Win(p — 2,1’). 





Exercise 12.6.19 Show that the T-ideal TR) of a Pl-algebra R does not 
contain any standard polynomial s,, n > 1, if and only if T(R) C T(E). 


Hint. Use that if TCR) is not contained in T(/), then 
K(X) /(T(E) + T(R)) 


is a proper homomorphic image of F'(/). Using the basis of P,,(#) consisting 
of multilinear elements of the form 


Big. Bion [Pj1) Cio] ---[Lion_1> Lion], 1 << --- < tne, ja <<... < jor, 


(see Theorem 4.3.11 (ii) and the proof of Theorem 5.1.2) show that every 
proper subvariety of the variety generated by the Grassmann algebra satisfies 
some standard identity. Hence (P, N T(R))/(P, A TUR) 9 T(E)) contains 
M(1”) for some n. Use that the multiplicity of W(1”) in P,, is equal to 1 and 
derive that s, € T(R). 


Exercise 12.6.20 Prove for unitary algebras Theorem 8.2.1 of Kemer [139] 
describing the Pl-algebras with polynomial growth of the codimension se- 
quence. If the algebra R satisfies polynomial identities f/f; = fo = 0 such that 
fi € T(E), fo € T(U2(K)), then the growth of c,(R) is polynomial. (The 


inverse part of the theorem was handled in Exercise 8.2.2.) 


Hint. Let the unitary algebra R satisfy polynomial identities f,; = fo = 0 
such that fi; ¢@ T(E), fo € T(U2(K)). By Exercise 12.6.19, R satisfies some 
standard identity and by Theorem 8.4.5 of Kemer, some Capelli identity. 
Hence (Exercises 12.5.12 and 12.4.16) T(R) = T(P,(R)) for some m. The 
algebra F(R) is finitely generated and satisfies the nonmatrix polynomial 
identity f2 = 0. By the theorem of Latyshev in Remark 5.2.2, F(R) sat- 
isfies [r1, @9]...[%24-1, 24] = 0 for some k. Using arguments as in the hint 
to Exercise 12.6.19, K(X)/(T(R) + T(U2(K))) satisfies the Engel identity 
x ad’ x, = 0 for some p. Derive from here that R itself satisfies the Engel 
identity, because the multiplicity of Wn(p,1) in By, is equal to 1. Obtain 
that the Engel identity together with [a1, 9] ...[ae4-1, 24] = 0 implies that 
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BY (R) = 0 for n large enough. (The number of the commutators in the 
products of commutators in BY (R) is bounded by k—1. The length of each 
commutator x; [];"., ad" 2; is bounded by 1 + (p— 1)m, otherwise it follows 


from the Engel identity). Since BY) (R) = 0 for big n’s, then Theorem 4.3.12 
(ii) gives the polynomial growth of the codimensions. 


Test 


As we have mentioned in the preface, the book is based on a graduate course 
given at the University of Hong Kong. We give the final test for the course. 
The students had one week to work on 12 problems. Any 6 problems solved 
were sufficient for “A”. Since the level and the purposes of the students were 
different and not all of them were graduate students working on algebra, the 
level of difficulty of the 12 problems is also different. 


1. Let H be the R-algebra with basis {1,7, j,k} and multiplication 








Papak=-1,ij=—ji=k, jk = —kj =i, ki=—ik =j. 


(i) Show that HI is an associative algebra. 

(ii) Calculate (a + 6i + yj +. dk)(a — Bi — yj — bk), a, 8, 7,6 ER. 

(iii) Show that every nonzero element of HI is invertible, i.e. HI is a non- 
commutative “field” . 





2. Let ay = €11 —€22, do = €19 +€21, G3 = €12 —€21 be elements of the algebra 
of 2x2 matrices with entries from a field K. Let a;,...a;, be a product which 
contains n; times a1, nz times ag, ng times a3, Ny +nN2+n3 = n. Let ¢; = 0,1, 
€; =n; (mod 2), i= 1, 2,3. Show that 


aj, ..-d;, = aa;'as7a;°, a € K. 


1 n 


3. Show that the three-dimensional Lie algebra C® with basis {i,j,k} with 
the usual vector multiplication 


i jk 
(ait Bij t+ yk) x (aei+ Gjt+yek)=loar fo oy 
az 92 2 


is isomorphic to the Lie algebra sl2(C) of all traceless 2 x 2 matrices with 
entries from C. 


4. Let G be the two-dimensional Lie algebra with basis {a, 6} and multiplica- 
tion [a,b] = a. Present in a canonical form (as in the Poincaré-Birkhoff- Witt 
theorem) the element f = baabab of the universal enveloping algebra U(G). 
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5. Calculate the codimensions of the polynomial identities of the algebra of 
2x 2 upper triangular matrices over a field of characteristic 0. 


6. Show that over a field of characteristic 0, the polynomial identity 


([kx, y).[x, aI]. 2] =0 


is a consequence of the standard identity s4(a1, %2, #3, #4) = 0. 


7. Let C, = K[t]/(t*) be the polynomial algebra in one variable modulo the 
ideal generated by ¢*, k > 1. Let 


Cc tc 
Ri = (ic C ) 
be the subalgebra of the 2 x 2 matrix algebra with entries from C, i.e. Ry con- 


sists of all matrices such that the “other” diagonal contains only polynomials 
without constant terms. Show that R, satisfies the polynomial identity 


[ay, r9| tae [@on-1, LK] = 0. 


8. Let chark = 0 and let k EN. 


(i) Show that in the free associative algebra without 1 the elements yz*, 


k k 


a*z and ya*z are linear combinations of some uf,..., uX. 
(ii) Show that in the free nonunitary associative algebra K(X) the T-ideal 
generated by x* coincides with the vector space spanned by all u*, u € K(X). 


9. Let ao, a1, a@2,... be the sequence defined by 








ao = 0, dg = 1, a2 = 2, An 43 = 3an41 — 2an. 
Find a closed formula for a, (i.e. a formula which expresses a, in terms of n 


only and does not involve the previous elements of the sequence). 


10. Let U, be the algebra of k x k upper triangular matrices with entries from 
an infinite field. Calculate the Gelfand-Kirillov dimension of the relatively free 
algebra 


Fry (Up) = K(ai,...,€m)/(T(UR) OK (a1,...,em)), m> 1. 


11. Let 


y= (x —2y(y? + @z)— 2(y? +a2)*, yt 2(y? +22), 2,0) 





be the Nagata automorphism acting on the polynomial algebra K[2, y, z, ¢]. 
Decompose vy as a product of linear and triangular automorphisms. Write 
explicitly the action of these automorphisms on the generators 2, y, z,t. 
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12. Show that the following endomorphism of K(«, y, z) is an automorphism 
and find its inverse 


b= (x+y(xy — yz), vy, 2+ (zy — yz)y). 


Hints 


1. The algebra HI is the quaternion algebra (compare with the quaternion 
group in Exercise 12.1.19). 

(i) Verify the associative law on the basis vectors 4, j, k. 

(ii) Answer: 


(a + Bi +77 + 6k)(a — Pi-— yj — 6k) = 07 + BP + 7° 48°. 





(iii) Answer: 
a— Bi-—yj — bk 


7 j+ 6k)7} = ———__* 
(a+ Bi + yj + Ok) mPa ae 


(a, 3, 7,6) # (0,0, 0,0). 


2. This is Exercise 12.6.1. (Chapter 12 was not included in the originally 
given course.) 


3. Define the isomorphism ¢ : C? > sl2(C) by 
#(i) = ada), é(j) = Gas, #(k) = 743, 


where a , @2,@3 are as in Problem 2 and a, 3,7 € C are chosen in such a way 
that 


o(k) = $(1 x j) = [¢0), 6G)] = 2aGaj ay = 2ya3, 
similarly for i= j x k and j =k x i. Use Remark 12.6.3. 


4. Use that in U(G) 


[a,b] =a, ba=ab—a 


and express f as > aga" b!. 


5. This is a partial case of Exercise 5.2.7. Use that the exponential codimen- 
sion series is 


@(U2(K),t) = il — ((t— Vet + 1)?). 


Another approach is to use directly that y,(U2(K)) =n— 1, n> 0, and 


nm 


en(Uat)) => ("Jap (U2). 


p=0 
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Answer: ¢,(U2(K)) = 24 2"~!(n-2),n> 1. 

6. Calculate s4(21, 22, 13, 27). 

7. Show that [Ry, Ry] C #R_. Compare this problem with Exercise 12.6.17. 


8. (i) The partial linearizations of a polynomial identity f(x1,...,a@m) are 
obtained by Vandermonde arguments and, therefore, are linear combinations 
of f(ur,...,Um), us € K(X) (1 ¢ K(X)). Hence the partial linearization 


k-1 k-1 


f(a, y) = akly + ak 2ye +. + eye®? + ye 


is a linear combination of u#, u; € K(X). We consider fx(z +27, y), take the 
homogeneous component of degree k + 1 (Vandermonde arguments again) 
and obtain 


(a*y + ya") + (k — 2)(a*y tak lye +... + eye®! + ya*) = 
= (wy + ye) + (k — 2) fe (w, y)- 


k+l is the homogeneous component of degree k + 1 of (x + x7)*, we 


Since kx 
obtain that fx41(x, y) is also a linear combination of some u*. Finally, since 
ady is a derivation, we see that 


e*y — yx* = [x* y] = eS ady = f(x, [x, y]), 


ie. x*ytyz* and 2*y—yzxr* are both linear combinations of uf and the same 
holds for yx* (similarly for 2*z). Now 


ya 2 = (ye*)z = So asus z = So ai S > Bij vf. 


(ii) This is a precised version of Nagata-Higman Theorem 8.3.2 (see the 
proof of Higman [127]). Since 


(xk)? = {> Uys w; | up, Vg, We E K(x)} ; 

and yx*z = \~ ag) (a, ¥, 2), 9; © K(X), we obtain that 
wut ws = S— ongh (vi, Ur, wi), 

and this completes the proof. 
9. Use Exercise 6.1.3 or prove directly by induction that a, =n. 
10. This is Exercise 9.4.5. 
11. Follow the proofs of Theorem 10.4.8 and Corollary 10.4.9. 
12. Show that ¢(ay — yz) = ay — yz. Answer: 


o-' = (@ — (zy — yz), y,2 — (wy — yz)y). 
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